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Preface 


Since the discovery of the DNA structure, researchers have been highly interested in 
the molecular basis of genome inheritance. This book covers a wide range of aspects 
and issues related to the field of DNA replication. The basic process of DNA 
replication is highly conserved among all domains of life. To sustain genetic stability 
the cell has to ensure that entire genome is replicated exactly once and only once per 
cell cycle. However, modifications of the cell cycle leading to genome multiplication 
occur in the animal cells during polyploidization of trophoblast cells in mammalian 
placenta (reviewed by Zybina and Zybina). On the other hand, meiotic DNA 
replication reduces a diploid cell to four haploid gametes. Stuart in his chapter 
describes numerous features that distinguish regulation and progression of meiotic 
DNA replication from DNA replication during mitotic proliferation, connecting DNA 
replication and homologous recombination. 

Several chapters are dealing with viral DNA replication. Isomura points to regulation 
of expression of human cytomegalovirus DNA polymerase processivity factor as a link 
of viral DNA replication and transcription. Successful development of new 
approaches for antiviral therapy necessitates better comprehension of molecular 
mechanisms that regulate viral DNA replication fidelity (chapters by Matamoros et al., 
Hwang). 

The DNA repair is one of the most important genome surveillance systems of the cell 
and DNA replication is an integral part of all mechanisms for the repair of DNA 
damage. Members of repair family of proteins are emerging as essential components 
linking DNA damage recognition to cell-cycle checkpoints (Her, Xu and Wu). In his 
chapter, author McCabe summarized mechanisms of DNA repair with focus on 
biochemical activity of polymerases, while relationship between the processes of DNA 
synthesis and recombination is discussed in chapter by Zavec. 

Insights into the process of the protein-primed replication mechanism as one of the 
strategies for management the end-replication problem of linear genomes is 
describedin chapter by Salas and de Vega. 

Two chapters are addressing tissue-specific regulation of DNA replication. Current 
molecular understanding of DNA replication with a focus on developmental-stage and 
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tissue-specific regulation in the animal model Caenorhabditis elegans is presented in 
chapter by Ruijtenberg and Heuvel and The, whereas Czaja and coworkers discuss 
possibility of DNA replication in the adult mammalian neural tissue. 

Presence of proteins implicated in formation of prereplication complex could be the 
first sign of cells intention to proliferate and their use as novel proliferative markers is 
reviewed in chapter by Karimi. 

DNA replication is tightly coordinate with other cellular processes and it's not 
surprising that proteins involved in chromosome replication also has additional role in 
cell life, like SeqA regulation of transcription (Amine et al.). 

This volume outlines our current understanding of DNA replication and related 
cellular processes, and gives insights into their potential for clinical application. 


Dr. Jelena Kusic - Tisma 

Laboratory for Molecular Biology, 
Institute of Molecular Genetics and Genetic Engineering, 

Belgrade, 

Serbia 
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Mini-Chromosome Maintenance Protein Family: 

Novel Proliferative Markers - 
The Pathophysiologic Role 
and Clinical Application 

Shirin Karimi 1 and Makan Sadr 2 

1 Shahid Beheshti University of Medical Science 
faculty of Medicine, Tehran University of Medical Science, Tehran, 

Iran 


1. Introduction 

Proliferation markers are among the most important biologic markers in the pathogenesis of 
many benign and malignant tumoral lesions and also some non-neoplastic diseases. 
Extensive studies have been conducted on this matter shedding light on the role of these 
markers in the pathogenesis of many of these lesions and their contribution to standard 
diagnostic protocols, determination of prognosis and even treatment monitoring of diseased 
cases. 

Cancer is among the major causes of morbidity and mortality worldwide. Determination 
and recognition of biomarkers that detect cancer in its early stages, monitor the disease 
progression or work as a specific marker for disease prognosis can boost our ability in 
confronting such conditions and improve cancer patients' care by creating a personalized 
medicine for them. Assessment of the cell growth or proliferative signature of tumoral 
lesions is among the main parameters in recognition of the biologic course of cancer, 
prognosis and evaluation of the treatment course. 

At present, we focus on recently introduced proliferative markers; MCM protein family, 
their basic biologic role and short review of the clinical application. 

2. Cell cycle and proliferative markers 

Cell proliferation is a precisely supervised process initiated and controlled by a large 
number of molecules and interrelated pathways. Cell proliferation is induced and started 
by the act of growth factors. A controlled sequence of events take place sequentially for 
duplication and division of cell DNA during a process called cell cycle. The cell cycle 
consists of four distinct phases: Gi phase (pre-synthetic), S phase (DNA synthesis), G 2 phase 
(premyotic) and M phase (mitosis). Quiescent phase or GO is a resting phase where the cell 
has left the cycle and has stopped dividing (1). Replication of the genomic DNA should be 
completed before the onset of mitosis and is performed once in every cell cycle. 

Diagram of the cell cycle: 
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Several antigens are expressed during a cell cycle the oldest of which being Ki67 antigen. 
Some important antigens related to cell cycle were discovered later including PCNA, KiS2 
and MCM. 

Ki67 antigen was discovered by a German group of scientists (2) in early 1980s and 
identified by using mice monoclonal antibodies against a nuclear antigen from Hodgkin's 
lymphoma cell line. This antigen is a non-histone protein. The name is derived from the city 
of origin (Kiel, Germany) and the number of the original clone in the 96-well plate (2). Ki67 
antigen has different expressions during various phases of cell cycle. Cells express this 
antigen in Gl, S, G2 and M phases but they lack it in GO phase. Concentration of Ki67 is low 
in Gl phase and reaches its peak during S phase. Ki67 is down-regulated during anaphase 
and telophase. Various studies on cell cycle analysis have shown that Ki67 antigen is not 
expressed in early Gl phase. Several antibodies are routinely being used for detection of 
Ki67 in paraffin embedded tissue samples using immunohistochemistry. At present, Ki67 
index score is routinely employed showing tumoral cells exhibiting nuclear staining. Use of 
Ki67 as a diagnostic and prognostic marker in many neoplasms has been extensively studied 
and its role in standard biological evaluation of the clinical course and management of 
cancers among them Lymphomas and breast cancers has been well recognized (3-6). 

MCM and cell cycle: 

Numerous proteins have been recognized to play a role in initiation of DNA replication 
which mainly include Origin Recognition Complex (ORC) and MCMs (7). Prokaryotes lack 
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MCM proteins and only eukaryotes possess this special type of molecules. However, some 
related proteins have been found in some prokaryotes like Archaea (8-10). 
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Fig. 1. Phylogenetic tree of eukaryotic MCMs, assembled using ClustalX (ftp://ftp-igbmc.u- 
strasbg.fr/pub/ClustalX/) and Phylip 3.6 

(http://evolution.genetics.washington.edu/phylip.html) for Macintosh. Colors correspond 
to the seven MCM subfamilies. Dashed line, loose relationship. Accession numbers are as 
follows. S. pombe: SpMcm2, CAB58403; SpMcm3, P30666; SpMcm4, P29458, SpMcm5, 
CAA93299 and CAB61472; SpMcm6, CAB75412; SpMcm7, 075001. S. cerevisiae: ScMcm2, 
NP_009530; ScMcm3, NP_010882; ScMcm4, S56050; ScMcm5, A39631; ScMcm6, NP_011314; 
ScMcm7, S34027. Human: HsMcm2, P49736; HsMcm3, P25205; HsMcm4, NP_005905; 
HsMcm5, AAH03656; HsMcm6, NP_005906; HsMcm7, P33993; HsMcm8, NP_115874. 
Xenopus: Xmcm2, JC5085; Xmcm3,151685; Xmcm4, T47223; Xmcm5, PC4225; Xmcm6Z, 
AAC41267; Xmcm6, T47222; Xmcm7, T47221. Arabidopsis: AtMcm2, NP_175112.1; AtMcm3, 
NP_199440.1; AtMcm4, NP_179236.2; AtMcm5, NP_178812.1; AtMcm6, NP_680393.1; 
AtMcm7, NP_192115.1; AtMcm8?, NP_187577.1; unknown Mem, NP_179021.1. Drosophila: 
DmMcm2, AAF54207; DmMcm3, NP_511048.2; DmMcm4, S59872; DmMcm5, NP_524308.2; 
DmMcm6, NP_511065.1; DmMcm7, NP_523984.1. [11] 
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MCM proteins were first recognized in early 1980's in Bik-Kwoon laboratory because of 
their role in maintenance of plasmids and mini chromosomes in Saccharomycess Cervisiae 
proliferative cells (11). 

The MCM protein family is named for the genetic screen in budding yeast from which the 
founding members were originally isolated. They were defective in minichromosome 
maintenance, showing a high rate of loss of plasmids that contained a cloned centromere 
and replication origin [12,13]. 

These proteins play a role in the formation of prereplicative complex in G1 phase. By doing 
so, they license the chromatin for replication in the next phase of S (14). 

The MCM family of proteins is considered the key factor for initiation of replication 
regulation through cyclical DNA unwinding (14). Also, they play a role in condensation, 
cohesion, transcription and recombination (11). These proteins mainly include 6 major 
groups of MCM2 to MCM7 (11). In addition, 4 proteins of this family have been recognized 
to have independent function from the previously mentioned group including MCM1, 
MCM10, MCM8 and MCM9. It seems that the latter group of proteins only exists in multi¬ 
cellular organisms and higher eukaryotes. 

Although MCM1 and MCM10 belong to this family name wise, they do not have much in 
common with MCM2-MCM7. MCM1 is a transcription factor and does not have a direct role 
in DNA replication (15,16). 

MCM10 associates with MCM2-7 hexamer in the active replisome and helps to stabilize 
DNA polymerase a-primase [Reviewed in 17]. 

MCM8 has been reported in vertebrates and Drosophila, but not in fungi and nematodes, 
and although it retains some sequence similarities in the Walker B and R-finger, its Walker 
A ATPase motif contains sequences more like the canonical ATPases. [18] Intriguingly, 
while human MCM8 shares all the classic MCM features including a putative zinc finger and 
the IDEKFM and arginine finger motifs, it is the only MCM that has a classic GKS motif in 
its Walker A sequence. It is widely expressed in a variety of tissues and may not be restricted 
to proliferating cells [19, 20]. The protein is found in the nucleus, apparently chromatin 
associated during S phase [19]. 

MCM9 is also found in similar organisms with the exception that it is missing in Drosophila, 
and it is unique to the family in that it lacks the carboxy-terminal ATPase domain including 
the Walker B motif. [18] MCM9 mRNA was up-regulated by transcription factor E2E1 and 
serum stimulation in NIH3T3 cells [21]. 

Various members of this family have been studied in all eukaryotes by genetic and 
biochemical methods and it has been demonstrated that MCM2-MCM7 proteins have been 
present in the genome of all the studied eukaryotes and have not been subject to gene loss or 
functional replacement during evolutionary diversification of eukaryotes. 

In Drosophila, MCM4 corresponds to the gene disc proliferation abnormal [22], while in 
Arabidopsis, MCM7 is PROLIFERA [23], stressing their role in cell division. Human MCM2 
(BM28) was first identified as a nuclear protein [24], and human MCM3 (PI) was isolated as 
a DNA polymerase alpha-associated protein [25]. 

Unusual MCMs 

Unusual MCMs have been recognized during the course of various studies. For example, at 
present it has been found that some yeasts possess MCM6. However, some variants i.e. the 
zygotic form of MCM6 have been detected in Xenopus. Also, some variants of MCM4 have 
also been found (26). It seems that these variants are a substitute for normal MCM when 
adequate growth conditions are met. 
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3. DNA replication and MCM2-7 family proteins 

Prior to DNA replication and during late M and G1 phases of the cell cycle, MCM2-7 form 
the pre-replication complex (pre-RC) by being loaded on to the origin recognition complex 
(ORC) at the origin of replication. This is activated at the Gl-S transition of the cell cycle by 
the assembly of further protein components. [27] Only MCM2 and MCM3 have identifiable 
nuclear localization sequences (NLS), leading to an early suggestion that these MCMs 
provide nuclear targeting to the other members of the family [28]. In nearly all species, the 
bulk of MCMs are constitutively located in the nucleus throughout the entire cell cycle, with 
their chromatin association, rather than nuclear localization, subject to cell cycle regulation 
[24, 29-37]. However, there is still a role for the nuclear envelope in MCM complex assembly. 
This has been molecularly characterized using mutational analysis with the yeasts. 

MCM core is a trimeric complex that forms during purification in result of binding MCM4, 
MCM6, and MCM7 subunits tightly together. MCM2 binds to the core, but with decreased 
affinity. MCM3 and MCM5 form a dimer together and bind most weakly to the other 
MCMs, probably through MCM7 (Figure 2). [11] In the absence of other MCMs during in 
vitro reconstitution experiments, the MCM4,6,7 core will itself dimerize to form a dimer- 
trimer (MCM4,6,7)2, which is disrupted by addition of MCM2 [38-40]. 

All MCM members belong to the AAA+ ATPase family, which has a distinct ATPase 
domain that spans ~200 bases. This domain, referred to as the MCM box, consists of a 
Walker A ATPase motif, a Walker B ATPase motif, and an arginine finger motif (R-finger). 
Conserved sequences within the Walker B motif (IDEFDKM) and R-finger (SRDF) define the 
MCM family. Six of these members are conserved in all eukaryotes and form a 
heterohexameric complex known as MCM2-7, which has been studied extensively for its 
role in DNA replication. MCM2-7 is required for licensing and initiating origins of 
replication, and it acts during elongation as a helicase at the replication forks. Because of this 
function and studies in yeast, Arabidopsis and Drosophila, members of the MCM2-7 
complex, are thought to be essential [41]. 

The assessment of other multiple functions is consistent with studies in yeast, which showed 
that MCM proteins are far more abundant than would likely be required for the number of 
replication origins that exist, and this abundance cannot explain the fact that slight decreases 
in amounts of MCM proteins lead to the inability to complete S-phase and progress through 
the cell cycle [41]. 

Early data led to the identification of MCMs as central players in the initiation of DNA 
replication. More recent studies have shown that MCM proteins also function in replication 
elongation, probably as a DNA helicase. This is consistent with structural analysis showing 
that the proteins interact together in a heterohexameric ring. However, MCMs are strikingly 
abundant and far exceed the stoichiometry of replication origins; they are widely distributed 
on unreplicated chromatin. Analysis of MCM mutant phenotypes and interactions with 
other factors has now implicated the MCM proteins in other chromosome transactions 
including damage response, transcription, and chromatin structure. These experiments 
indicate that the MCMs are central players in many aspects of genome stability [11]. 

This family of proteins has been studied for interaction with other genes like Rb gene. 

4. MCM gene expression, DNA replication and Retinoblastoma gene 

Model showing RBR3 role in the RBR/E2F pathway controlling the expression of MCM2-7 
genes, DNA replication, and cell transformation. RepA inhibits RBR1; thus, stimulating the 
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pathway leading to S-phase gene expression, DNA synthesis, and cell transformation 
through up-regulation of RBR3. The transgenic approaches to down- or up-regulate RBR3 
are indicated in italics. The dotted line illustrates a potential inhibitory effect of RepA on 
RBR3 ruled out by Sabelli et al work [42]. 


Fig. 3. 
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5. Expression of MCM protein family as a biological marker of proliferation in 
various diseases 

Genome of the MCM family is necessary for DNA replication and its role has been studied 
in various diseases and cancers. After the conduction of aforementioned basic studies, it was 
quickly revealed that this family of proteins not only can be considered as a cell proliferation 
marker but also can out-power previous classic factors of proliferation such as Ki67 because 
MCM expression in all phases of cell cycle. 

Parvaresh and colleagues (7) through cytometer analysis showed that number of cells 
expressing MCM6 in the proliferation phase was higher than those expressing Ki67 which 
was due to the expression of MCM6 at early G1 phase, a phase of cell cycle which does not 
express Ki67 antigen. This study suggested that MCM6 may be a unique marker of cell 
cycle and might be employed as a novel prognostic marker for management of cancers. 

The following is the summary of studies on different members of this family: 


6. Clinicopathologic studies on expression of MCM family proteins as 
proliferative markers 

6.1 Expression of MCM family proteins in non neoplastic diseases 

DNA synthesis disorders and DNA damage response can also be important in pathogenesis 
of many non-neoplastic diseases. Since MCM family proteins play a major role in initiation 
of DNA synthesis and DNA damage response, evaluation of MCM subunits can be effective 
in recognizing the cause of various non neoplastic diseases. 

Cortez and colleagues showed that 2 MCM subunits namely MCM2-3 and MCM7 can be 
used as a check point for S phase considering their correlation with Ataxia-telangiectasia 
mutated (ATM) and ATM- and Rad3-related (ATR) and ATR-interacting protein (ATRIP)- 
interacting subunit (43). 
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Evaluation of these factors has also helped in pathogenesis of Diabetes and some viral 
diseases. 

Willcox and coworkers (45) demonstrated that in type I diabetes Alpha and Beta cells 
undergo an increase in proliferation during progression. These cells show a high level of co¬ 
expression of Ki67 and MCM which are indicative of a proliferative response in an 
autoimmune attack during the course of diabetes type I. 

Qian and colleagues (44) showed that MCM complex can be effective in understanding the 
pathogenesis of many viral diseases. Targeting MCM complex is one mechanism pUL117 
employs to help block cellular DNA synthesis during HCMV infection. Their finding 
substantiates an emerging picture that deregulation of MCM is a conserved strategy for 
many viruses to prevent host DNA synthesis and helps to elucidate the complex strategy 
used by a large DNA virus to moderate cellular processes to promote infection and 
pathogenesis. 

6.2 Role of MCM family proteins in neoplastic lesions 

Since classically proliferative biomarkers like Ki67 and proliferating cell nuclear antigen 
(PCNA) are known as the indices of proliferation phase, they are extensively used as 
diagnostic biomarkers in many types of cancers. 

Recently, MCM family proteins are a group of proteins that has been described in DNA 
replication in both benign and malignant tumors. As MCM proteins are only recognizable in 
cells which are in the cell cycle, therefore, it seems that they could be a better indicator of 
proliferative cells, cancer cells or malignant tissues compared to conventional biomarkers. 
Numerous studies has been suggested that their expression in some of the preneoplastic 
lesions and malignancies is often associated with a higher degree of cell atypia and poor 
prognosis. 

Up to our knowledge, expression of MCM family proteins has been extensively studied in 
neoplastic disorders including skin tumors, meningioma, non-small cell lung cancer, 
Hodgkin's lymphoma [47], prostate cancer, oral tongue squamous cell carcinoma [48], 
chondrosarcoma, oligodendroglial tumors, esophageal neoplasm, renal cell carcinoma, 
colonic cancer, breast cancer, endometrial carcinoma, thyroid carcinoma, gastric 
adenocarcinoma, merckle cell carcinoma, cervical carcinoma and bladder carcinoma. A 
summary of these studies is as follows: 

Among skin tumors, squamous cell carcinoma, Bowen disease, basal cell carcinoma, 
malignant melanoma, and nevus have been studied (46). Also, Shin et al. reported a 
significant positive correlation between MCM2 immunoactivity and grade of actinic 
keratosis. They declared MCM2 as a reliable marker for diagnosis and grading and 
suggested further investigation on its prognostic value. 

Shahjahan and associates [49] studied ProEx C, a biomarker reagent containing 
antibodies to minichromosome maintenance protein 2 (MCM2) and topoisomerase II A 
(TOP2A) used to detect aberrant S-phase induction in cells. The authors studied 289 
non-small cell lung cancers using immunohistochemistry and found ProEx C 
expression in more than two-thirds of the cancers and an association between strong 
expression and a longer 5-year survival in certain cellular subtypes. The findings 
suggested a role in tumor progression of these cancer cells and might be a potential 
basis for targeted therapy. 
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Histomorphology and immunohistochemistry studies also showed increased 
expression of MCM2 in areas of malignant transformation in recurrent pleomorphic 
adenoma (50). 

Nuclear expression of MCM2 has been demonstrated in a large number of breast cancer 
patients. Its expression in dysplastic, malignant and cancer cells can be predictive of 
potential malignancy and can help in determining the grade of breast cancer (51, 52). 
Expression of MCM3 has been evaluated in astrocytic tumors and cervical carcinoma 
(53). 

High expression of MCM4 has been reported in meningioma and cervical carcinoma 
(52, 54). 

Also, MCM4 may play an essential role in the proliferation of some NSCLC cells. Taken 
together with higher expression in NSCLCs and its correlation with clinicopathologic 
characteristics such as non-adenocarcinoma histology, MCM4 may have potential as a 
therapeutic target in certain population with NSCLCs [55, 56]. 

Increased expression of MCM4 might be associated with pathological staging of 
esophageal cancer [57], 

MCM5 expression has been shown in hepatitis induced carcinogenesis (58), 
adenocarcinoma of the stomach (59), and meningioma (60). Co-expression of MCM2 
and MCM5 as a marker of proliferation and differentiation has been evaluated in colon 
cancer. High expression of these two in mild and moderate cutaneous dysplasia in 
proliferative lesions of verrucous leukoplakia can help in studying the prognosis of 
their malignant transformation. Despite the expression of MCM4 and MCM5, increased 
expression of MCM6 and MCM7 has also been studied in meningioma (54). 

A study showed that expression of MCM7 in esophageal squamous cell carcinoma was 
associated with a more invasive nature (61). 

Aberrant over-expression of proteins called minichromosome maintenance (MCM) 
proteins at the mucosal surface of dysplastic esophageal squamous epithelium and 
Barrett's mucosa may indicate proliferation potential. [62] 

MCM7 detected more cells in the cycle than Ki67 and PCNA and all cases of SC 
glioblastoma, the most aggressive subset, displayed a significant increase of MCM7- 
stained nuclei versus those stained with Ki67. [63] These studies implicate MCM7, and 
the DNA replication licensing gene family, in prostate cancer progression, growth and 
invasion. [64] MCM-7 also has been studied in gestational trophoblastic disease [65] and 
metastatic colon carcinoma [66]. 

In previous studies such as in Fujioka et al [67] study they demonstrated that higher 
levels of MCM 7 expression were correlated with poor differentiation of tumors, non- 
bronchioloalveolar carcinomas of lung, large tumor size and poor prognosis. Li et al 
[68] also showed that MCM 7 expression was significantly correlated with poor 
histologic grade, old age, and poor survival in cases of endometrial carcinoma. 
Padmanabhan et al [69] revealed that MCM 7 was associated with tumor stage and 
perineural invasion in prostatic intraepithelial neoplasia and invasive adenocarcinoma. 

6.3 The role of MCM protein family in cancer treatment 

Because of MCM family proteins' vital role in genome duplication in proliferating cells, 
deregulation of the MCM function results in chromosomal defects that may contribute to 
tumorigenesis. As we already reviewed, the MCM proteins are highly expressed in 
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malignant human cancers cell and pre-cancerous cells undergoing malignant 
transformation. They are not expressed in differentiated somatic cells that have been 
withdrawn from the cell cycle. Therefore, these proteins are ideal diagnostic markers for 
cancer and promising targets for anti-cancer drug development. [70) 

In this respect, medications targeting some members of the MCM family are considered 
novel anticancer drugs. 

Two studies evaluated the role of medications in management of the tumor in prostate 
cancer patients by measuring the expression of MCMs. In one of these studies due to the 
high level of MCM expression in these lesions Genistein and Trichostatin (TSA) were 
administered resulting in down-regulation of all MCM genes and subsequently decreasing 
the S phase in tumoral cells of the prostate cancer (54). 

Iljin et al, (71) in their study indicated that three novel cancer selective growth inhibitory 
compounds can result in decreased DNA synthesis. This reduction can be evaluated via 
MCM expression. 

7. Conclusion 

Members of the MCM family play a key role as the initiator of DNA replication working as 
DNA helicase. They are also involved in the process of transcription, cohesion, 
condensation, and recombination in both the nucleus and the cytoplasm. These markers 
have been extensively evaluated in basic and clinical studies. Aforementioned clinical 
studies showed the expression of these proteins specially MCM 2, 3,4,5,6,7 specially in 
preneoplastic and cancers and also in some viral and endocrine diseases eg Diabetes . They 
have been suggested as standard diagnostic and prognostic biomarkers in some tumoral 
lesions. 

Many of these proteins can be employed as a target for anti-cancer medications currently 
present in the market or those under development. 

Further studies on various members of this family in all the pathologic diseases specially 
precancerous lesions and malignant processes can illuminate their pathogenesis and 
biologic behavior . In tumoral lesions, these markers can be easily evaluated through 
immunohistochemistry. Therefore, it is recommended that research projects focus on 
studying not only one of them but evaluation co expression of some of the various members 
of this family in tumoral, pre-neoplastic and neoplastic lesions in all organs.In this way, 
these proliferative markers can gradually substitute the standard proliferative index 
markers like Ki67 which was the main objective of the present review. 
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1. Introduction 

Replication of the DNA during the synthesis (S) phase of the cell cycle is one of the most 
critical aspects of cell division. DNA replication must be highly accurate and tightly 
controlled to maintain genomic integrity over many rounds of cell division. This is 
particularly important during animal development, since genetic instability can lead to cell 
death, birth defects, developmental abnormalities and diseases such as cancer. The 
developmental context also adds specific constraints to S-phase regulation. For instance, 
variations in DNA replication control are needed to accommodate the rapid embryonic 
divisions in early embryos, the production of haploid germ cells, and the generation of 
polyploid tissues. A comprehensive understanding of DNA replication requires insight in 
these developmental aspects of S phase control. Here, we review the initiation of DNA 
replication in the genetic animal model Caenorhabditis elegans (C. elegans), with a focus on 
developmental-stage and tissue-specific regulation. 

2. Caenorhabditis elegans 

Caenorhabditis elegans (C. elegans) was introduced as a model organism in the 1960s by 
Sydney Brenner and became, in a relative short time, one of the leading model organisms 
in biological research (Ankeny 2001). One of the appealing aspects of this nematode is its 
rapid and reproducible development from the one-cell embryo to the adult stage (Sulston 
& Horvitz 1977). The invariance, combined with the fact that the animals are transparent 
and contain a relatively low number of cells (adult hermaphrodites contain only 959 
somatic cell nuclei), has made it possible to record the entire somatic cell lineage of C. 
elegans (Horvitz & Sulston 1980; Sulston & Horvitz 1977; Sulston & Horvitz 1981). 
Knowing when each cell normally divides is a major benefit for studies of the cell cycle. 
Efficient genetics has allowed identification of mutations that alter the normal cell lineage 
(tin mutants), some of which affect DNA replication or DNA content (Horvitz & Sulston 
1980; Sulston & Horvitz 1981). As an additional advantage, many cell cycle regulators that 
exist in gene families in higher eukaryotes are represented by single genes in C. elegans, 
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which helps identification of gene function and determination of the hierarchy of gene 
functions in regulatory pathways. 

While these aspects make C. elegans suitable for cell cycle studies, there are additional 
reasons for adding this animal to the repertoire of cell cycle models. Studies of DNA 
replication in the context of a developing organism may identify regulatory mechanisms 
that are not important for single cell eukaryotes and cells in tissue culture. The 
developmental context adds an extra layer of S-phase regulation. For instance, in meiosis, 
two rounds of chromosome segregation follow each other without intervening S phase, 
while in endoreplication cycles, rounds of DNA replication continue in the absence of M 
phases. In addition, a broad range of models also increases the potential for uncovering 
important aspects of DNA replication control. For example, studies in C. elegans identified 
a CUL-4/DDB-1 E3 ubiquitin ligase complex as an important inhibitor of DNA re¬ 
replication, which is functionally conserved in mammals (Arias & Walter 2007; Kim & 
Kipreos 2007a; Zhong, et al. 2003). In addition, defects in DNA synthesis were found to 
cause lineage-specific delays in cell division in C. elegans, through a checkpoint 
mechanism that also contributes to the difference in timing of founder cell division in the 
early embryo (Brauchle, et al. 2003; Encalada, et al. 2000). Furthermore, our recent results 
support tissue specific contributions of a conserved general regulator of DNA replication, 
MCM-4 (Korzelius, et al. 2011). Below, we describe the currently known factors that 
control DNA replication in C. elegans, as well as their functions in particular stages of 
development and specific cell types. Several techniques used for analysis of DNA 
replication in C. elegans are summarized in BOX 1. 

3. The factors that regulate DNA replication 

The regulation of DNA replication in eukaryotes involves two discrete steps. First, pre¬ 
replication complexes assemble at sites of replication initiation ("origin licensing"), and 
subsequently, the actual initiation of DNA synthesis can be triggered ("origin firing"). 
Comprehensive studies aimed at identifying all components involved in DNA replication 
have not been reported for C. elegans. However, functional annotations by the C. elegans 
genome sequence consortium have revealed orthologs of many DNA replication 
components (www.wormbase.org). In addition, some DNA replication genes have been 
identified through mutations, and genome-wide RNA interference (RNAi) has confirmed 
that most putative replication components exert critical functions (Encalada, et al. 2000; 
Korzelius, et al. 2011; Sonnichsen, et al. 2005). Despite their clear conservation, certain 
well-known replication genes currently appear to lack C. elegans counterparts (see Table 
1). For instance, in eukaryotes ranging from yeast to human, the origin recognition 
complex (ORC) has been found to consists of 6 subunits, ORC1 to ORC6. At present, ORC- 
2 is the only ORC protein identified in C. elegans, and its function has not been 
characterized in detail. 

Recruitment of the ORC is normally the first step in pre-replication complex assembly, 
which is followed by association of the CDC6 and CDT1 proteins. C. elegans does contain 
legitimate CDC-6 and CDT-1 orthologs, which are essential for DNA replication and 
required for embryonic as well as larval viability (Kim, et al. 2007; Kim & Kipreos 2008; Kim 
& Kipreos 2007a; Kim & Kipreos 2007b). Simultaneous overactivation of CDC-6 and CDT-1 
leads to extensive re-replication, which underscores the role of CDC-6 and CDT-1 as critical 
regulators of origin licensing. 
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BOX1: C. elegans DNA replication analysis 

One of the advantages of the use of C. elegans as a model system is that the animal is 
fully transparent, which allows the use of Differential Interference Contrast (DIC, 
also known as Nomarski) microscopy for live observations of cell division. 
Moreover, expression and localization of the green fluorescent protein (GFP) and 
other fluorophores can be followed by time-lapse microscopy. Introduction of 
transgenes with tissue or cell type-specific promoters that drive expression of GFP 
or GFP-tagged fusion proteins is a routine procedure in C. elegans (Mello & Fire 
1995). However, transgenes are usually silenced in the germline and in early 
embryos, which can be avoided by integrating a single copy transgene through 
DNA particle bombardment or the MosSCI technique (Frokjaer-Jensen, et al. 2008; 
Praitis, et al. 2001). We have recently applied the MosSCI strategy for integration of 
a single copy transgene expressing an MCM-4::mCherry protein fusion, which 
rescues mcm-4 null mutants and shows a similar expression pattern and subcellular 
localizations as the endogenous MCM-4 protein (Korzelius, et al. 2011, and our 
unpublished results). 

In addition to gene expression studies, DNA replication itself can be visualized in 
multiple different ways. The most quantitative method makes use of determination 
of the DNA content. The DNA content of a cell correlates with the cell cycle phase: 
cells in G1 have a ploidy of 2n; S phase cells between 2n and 4n; and cells in the G2 
and M phases 4n. To measure the DNA content, animals are fixed and stained with 
a dye that fluoresces when bound to DNA, such as propidium iodide, Hoechst 
33258, or DAPI (4'6'- diamidino-2-phenylindole dihydrochloride). The most 
accurate method, but also the most time consuming, for in situ quantification is 
analysis of the fluorescence signal in confocal serial sections of propidium iodide- 
stained nuclei (Boxem, et al. 1999; Feng, et al. 1999; Zhong, et al. 2003). The accuracy 
of this method makes it ideal for experiments in which small differences in DNA 
content must be distinguished, e.g. when comparing cells in G1 vs. S phase. 

In order to investigate if cells go through the process of DNA replication, or 
whether DNA replication takes place at specific times of development, 
incorporation of the thymidine analogues 5-bromo-2'-deoxyuridine (BrdU) or 5- 
ethynyl-2'-deoxyuridine (EdU) can be used. BrdU incorporation can be detected by 
immunostaining with specific anti-BrdU antibodies. EdU detection is based on a 
copper (Cu 1+ ) catalyzed covalent "click" reaction between an azide attached to a 
fluorescent dye and the alkyne group of EdU (Salic & Mitchison 2008). While BrdU 
detection in C. elegans has been possible for some time (Boxem, et al. 1999), the EdU 
method is new and has been applied only in a few recent studies (Fig.l) (Cinquin, et 
al. 2010; Korzelius, et al. 2011). The EdU method has a major advantage over BrdU 
staining: while BrdU detection requires DNA denaturation, this step is not needed 
in the EdU procedure. As a result, EdU incorporation can be combined with 
immunostaining with antibodies, which can be a great help in visualizing cells of 
interest. 
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Flow cytometry is commonly used for DNA quantification in other systems. 
Although this technique is not widespread, flow cytometry has been used to 
produce accurate measurements of DNA content for freshly dissociated C. elegans 
cells (Bennett, et al. 2003). The dissociated C. elegans cells represented multiple cell 
types, which reduces the utility of the DNA distribution information. This 
limitation can be avoided by using strains in which cells of interest are marked with 
transgenes that express GFP (or other fluorescent tags). GFP expression can be used 
to gate cells of interest in the flow cytometry analysis so that the DNA distribution 
of only the GFP expressing cells is analyzed. In future studies, this coupling of 
selective GFP expression with propidium iodide staining will probably be applied 
more broadly in the analysis of the DNA distribution of specific tissues and cells of 
interest. 
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Fig. 1. EdU incorporation and staining visualizes DNA replication in C. elegans 
larvae. EdU incorporation in cells of the ventral nerve cord in a first stage larva (A, 
B, and C) and nuclei in the intestine of an early L4 larva (A', B', and C') are 
indicated by arrows. Panels show DNA staining by DAPI (A and A'), EdU staining 
(B and B') and merged images (C and C'). Note that cells that completed S phase 
prior to EdU addition stain with DAPI but do not incorporate EdU, such as the 
neurons indicated by arrowheads. One arm of the developing gonad is visible at the 
right (A', B', C'). 
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Studies in other systems have shown that CDC-6 and CDT-1 are needed to load the 
minichromosome maintenance (MCM) protein complex onto the replication origins. The 
MCM complex consists of 6 proteins, MCM2 to MCM7, which is thought to act as the helicase 
that unwinds the DNA at the replication origins. C. elegans contains orthologs of all six MCM 
genes, which are known as mcm-2 to man-7 and cause similar embryonic lethal phenotypes 
when inactivated by RNAi (Sonnichsen, et al. 2005). MCM-4 was initially identified through a 
mutation in the lin-6 gene, and is the only C. elegans MCM protein studied in detail (Korzelius, 
et al. 2011). MCM-4 is expressed in all dividing cells during embryonic and postembryonic 
development. It is strongly induced just prior to the Gl/S transition in somatic cells and 
disappears when cells exit the cell cycle. MCM-4 localizes to the cell nucleus in interphase, 
while in mitosis MCM-4 localization becomes diffuse throughout the cell upon nuclear 
envelope breakdown. In late anaphase, MCM-4 starts to colocalize with the DNA, presumably 
licensing the DNA for the next round of S-phase (Fig. 2). 


Protein 

Name 

S. cerevisiae 

S. pombe 

Drosophila 

melanogasier 

Mammals 

C. elegans 

Prereplication complex 

Orcl-6 

Orcl-6 

Orcl-6 

Orcl-6 

Orcl-6 

ORC-2 

Cdc6 

Cdc6 

Cdcl8 

Cdc6 

Cdc6 

CDC-6 

Cdtl 

Cdtl/Tahll/ 

Sid2 

Cdtl 

Dup 

Cdtl 

CDT-1 

Me m2 

Mcm2 

Mcm2/Cdcl9/Ndal 

Mcm2 

Mcm2 

MCM-2 

Me m3 

Mcm3 

Mcm3 

Mcm3 

Mcm3 

MCM-3 

Mcm4 

Cdc54/ 

Mcm4 

Cdc21 

Dpa 

Mcm4 

MCM-4 

Mcm5 

Cdc46/ 

Mcm5 

Mcm5/Nda4 

Mcm5 

Mcm5 

MCM-5 

Mcm6 

Mcm6 

Mcm6/Mis5 

Mcm6 

Mcm6 

MCM-6 

Mcm7 

Cdc47 / 

Mcm7 

Mcm7 

Mcm7 

Mcm7 

MCM-7 

Preinitiation complex 

McmlO 

Mcml0/Dna43 

Cdc23 

McmlO 

McmlO 

Y47D3a.28 

Cdc45 

Cdc45/Sld4 

Sna41/Cdc45 

Cdc45 

Cdc45 

F34D10.2 

Sld3 

Sld3 

Sld3 

- 

- 

- 

Dbpll 

Dbpll 

Cut5/Rad4 

MuslOl 

TopBPl 

MUS-101 

Sld2 

Sld2/Drcl 

Drcl 

- 

- 

- 

Sld5 

Sld5 

Sld5 

Sld5 

Sld5 

Y113G7B.24 

Psfl 

Psfl 

Psfl 

Psfl 

Psfl 

R53.6* 

Psf2 

Psf2 

Psf2 

Psf2 

Psf2 

F31C3.5* 

Psf3 

Psf3 

Psf3 

Psf3 

Psf3 

- 

Kinases 

Cdc7 

Cdc7 

Hskl 

Cdc7 

Cdc7 

C34G6.5 

Dbf4 

Dbf4 

Dfpl 

Chiffon 

Dbf 4 / Ask/ Drf 1 

- 


Table 1. Homologues of DNA replication components. *Based on homology searches only. 
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The absence of DNA replication, as observed in mcm-4 mutants, might be expected to trigger 
a checkpoint that delays mitotic entry. However, mcm-4 mutants enter mitosis in the absence 
of DNA replication and, initially, with normal timing, suggesting that mcm-4 is not only 
required for DNA replication but also activates a checkpoint that monitors completion of 
DNA replication (Korzelius, et al. 2011). This second function corresponds to the results 
obtained in studies with other organisms, which clarified the requirement of the MCM 
complex in activation of the DNA damage and replication checkpoints (Labib, et al. 2001; 
Zou & Elledge 2003). In addition to these well conserved functions, mcm-4 also displays a 
tissue-specific requirement in C. elegans, which will be discussed below (Korzelius, et al. 
2011 ). 



Fig. 2. Time-lapse fluorescence microscopy shows expression and localization of MCM-4 in 
an early embryo. MCM-4 is fused to mCherry and expressed from the mcm-4 promoter (A- 
E). Merged images of the DIC and fluorescence channels are shown in the bottom panels 
(A'-E'). The red MCM-4: :mCherry fluorescence is visible in the anterior AB and posterior PI 
cell in the two stage embryo (A and A'). Note that the AB cell enters mitosis before the PI 
cell (B and B'). MCM-4 can be detected on the chromosomes in late anaphase (arrowhead in 
PI cell, D and D"). 

Activation of the MCM2-7 complex is needed for opening the DNA helix and allowing the 
DNA polymerases to start DNA replication. This activation marks the end of origin 
licensing and the start of origin firing (Labib & Diffley 2001). Studies in several organisms 
have shown that the onset of S-phase requires CDK (cyclin dependent kinase) and DDK 
(Dbf-4 dependent Cdc7 kinase) activity to promote activation of the MCM2-7 helicase, while 
at the same time the recruitment of pre-replication complexes is inhibited (Bousset & Diffley 
1998; Nguyen, et al. 2001; Remus, et al. 2005). CDKs and DDK4 are not only required for the 
activation of the MCM complex, they also trigger the assembly of additional factors. This 
results in the formation of a "preinitiation complex" that contains a large and still growing 
group of proteins, such as Cdc45, McmlO, RPA and the DNA polymerases a and s (Bell & 
Dutta 2002; McGarry & Kirschner 1998; Mechali 2010; van Leuken, et al. 2008). Most of these 
factors have not been identified or investigated in C. elegans, and the formation and function 
of the preinitiation complex in C. elegans therefore remains elusive (Table 1). In animal 
systems, Geminin acts as an inhibitor of CDT-1, which is degraded in mitosis in an APC/C- 
dependent fashion (McGarry & Kirschner 1998; van Leuken, et al. 2008). C. elegans Geminin 
GMN-1 also associates with CDT-1 and inhibits origin licensing when added to frog egg 












Regulation of DNA Synthesis and Replication 
Checkpoint Activation During C. elegans Development 


21 


extracts (Yanagi, et al. 2005). GMN-1 inhibition results in germline defects and intestinal 
abnormalities with chromatin bridges. Thus, Geminin may be an example of a metazoan- 
specific regulator of DNA replication initiation. 

4. Preventing re-replication 

When DNA replication is initiated, origin licensing should be prevented, as re-firing of only 
a single origin may lead to gene amplification and could have dramatic consequences. 
Hence, all eukaryotes use multiple levels of control to prevent more than one round of DNA 
synthesis within a single S-phase, although the exact players and mechanisms differ 
somewhat between species. In general, there are two mechanisms used to prevent re¬ 
replication: firstly, formation of the pre-replication complex (prior to S-phase) and the 
activation of the origins (during S phase) are temporally separated, and secondly, proteins 
required for the formation of the pre-replication complex are inactivated as soon as DNA 
replication starts (Arias & Walter 2007; Blow & Dutta 2005; Machida, et al. 2005). 
Surprisingly, despite the importance of a single round of DNA replication and the 
redundant levels of control, certain single gene mutations cause substantial re-replication. 
As an important example, C. elegans cul-4 displays such a re-replication phenotype (Zhong, 
etal. 2003). 

cul-4 encodes the core subunit of a cullin based E3 ubiquitin ligase that targets substrate 
proteins for ubiquitylation and degradation. Kipreos and coworkers studied the effects of 
cul-4 inhibition by RNAi in the epithelial stem-cell like "seam" cells in the C. elegans skin. 
Interestingly, they observed that cul-4 RNAi resulted in seam cells with up to a lOOn DNA 
content and showed that this results from extensive re-replication rather than failed mitosis 
(Zhong, et al. 2003). As mentioned above, a key mechanism of preventing re-replication is 
inactivation of the components that form the pre-replication complex. Indeed, it was shown 
that cul-4 is required for the degradation of one of these components. When cul-4 is 
inhibited, CDT-1 levels do not drop at the end of G1 but remain constant throughout S- 
phase, indicating that CUL-4 is required for S-phase degradation of CDT-1. Subsequent 
studies in C. elegans and other systems demonstrated that CUL-4 in association with the 
DNA damage binding protein 1 (DDB-1) recognizes CDT-1 as a substrate (Arias & Walter 
2007; Blow & Dutta 2005; Kim & Kipreos 2007a; Kim & Kipreos 2007b). However, 
degradation of CDT-1 by CUL-4 is not the whole story, since expression of stable CDT-1 
alone does not cause noticeable re-replication. CUL-4 was also found to be responsible for 
the localization of CDC-6, another member of the pre-replication complex. CDC-6 normally 
accumulates in the nucleus during G1 phase, and is exported from the nucleus to the 
cytoplasm during S-phase. The activity of CUL-4 turned out to be needed for nuclear export 
of CDC-6. Thus, CUL-4 inactivation deregulates two essential factors of the pre-replication 
complex. High nuclear levels of both CDT-1 and CDC-6 in S-phase allow continued origin 
licensing and promote re-replication (Kim, et al. 2007; Kim & Kipreos 2007a). 

Although intriguing, the mechanism by which CUL-4 regulates nuclear export of CDC-6 in 
S-phase was not immediately apparent. However, two clues were available: CDC6 nuclear 
export is regulated by Cyclin-CDKs in other systems, and, similar to the human homolog, 
the amino terminus of C. elegans CDC-6 contains multiple nuclear localization signals 
flanked by potential CDK phosphorylation sites (Kim, et al. 2007; Kim & Kipreos 2007b; 
Kim, et al. 2008). Phosphorylation at these sites coincides with nuclear export, as 
demonstrated by phosphospecific-antibody staining, and mutation of all six CDK sites 
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prevented nuclear export. Thus, CUL-4 could promote nuclear export by stimulating CDK 
phosphorylation of the CDC-6 N-terminus. This is likely accomplished by degradation of a 
CDK inhibitor of the Cip/Kip family, known as CKI-1 in worms, Dacapo in flies and p21 Ci P 1 
in vertebrates (Fig. 3) (Bondar, et al. 2006; Higa, et al. 2006; Kim & Kipreos 2007a; Kim & 
Kipreos 2007b; Kim & Kipreos 2007b; Kim, et al. 2008; Korzelius, et al. 2011). 



Fig. 3. Preventing re-replication. Inactivation of CDT-1 and CDC-6 in S phase provides a key 
mechanism for preventing re-replication. The cullin RING ubiquitin E3 ligase (CRL) 
complex CRL4 cdl2 is critical in the inactivation of CDT-1 as well as CDC-6. CRL4 cdt2 contains 
the cullin protein CUL-4, adaptor DDB-1 and substrate recognition unit CDT-2. This 
complex recognizes its substrates in association with PCNA. CDT-1 and a CDK inhibitor of 
the Cip/Kip family, CKI-1, contain a PCNA interacting protein (PIP) motif in the N- 
terminus and are degraded by CRL4 cdt2 . As PCNA is an auxiliary factor of DNA 
polymerases, the degradation of CDT-1 and CKI-1 can be coupled to DNA replication. 
Inactivation of CKI-1 allows activation of S phase CDK/Cyclin kinases. CDK 
phosphorylation of the CDC-6 N-terminus promotes nuclear export of CDC-6. Because of its 
control of two critical pre-replication complex components, CUL-4 inactivation leads to 
extensive re-replication in C. elegans (see text for further details). 
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In each of these models, a cullin RING ubiquitin E3 ligase (CRL) has been identified that 
contains CUL-4, DDB-1 and a substrate recognition unit CDT-2. This CRL4 cdt2 complex 
recognizes its substrates in an unusual manner. CKI-1, p21 Ci P 1 and CDT-1 all contain a 
PCNA interacting protein (PIP) motif in the N-terminus (Havens & Walter 2009). PCNA is 
an auxiliary factor of DNA polymerases, which forms a ring around the DNA and acts as a 
sliding clamp. Because interaction with PCNA is a prerequisite for CRL4 cdt2 substrate 
ubiquitylation, degradation of the CKI and CDT-1 substrates is coupled to DNA replication. 
In summary, upon association with PCNA, the CDK-inhibitor CKI-1 is recognized by 
CRL4 cdt2 and targeted for degradation. This allows S-phase Cyclin-CDKs to phosphorylate 
CDC-6, which triggers CDC-6 export from the nucleus. In addition to CKI-1, CRL4 cdl2 also 
targets PCNA-bound CDT-1 for ubiquitin-dependent proteolysis. In C. elegans, CDC-6 
nuclear export and CDT-1 degradation are two redundant mechanisms that prevent re¬ 
replication (Fig. 3) (Kim & Kipreos 2007b; Korzelius & van den Heuvel 2007). Because C. 
elegans does not show redundancy for the CRL4 cdt2 E3 ligase in CDT-1 degradation, the 
function of this complex has been more obvious in C. elegans. 

5. Activation of the DNA replication checkpoint in early embryos 

Incomplete DNA replication activates an S-phase checkpoint, which delays progression 
through the cell cycle to create time for repair (Branzei & Foiani 2010). Central in this 
checkpoint is the ATR-Chkl protein kinase pathway, which is activated by lesions created 
by stalled replication forks. Active Chkl phosphorylates downstream cell cycle regulators 
such as the CDC25 phosphatase that controls the activity of CDK1. This S-phase checkpoint 
is generally not functional in early embryos. For example, inhibition of DNA replication 
with a low concentration of hydroxyurea (HU) does not affect cell cycle progression in 
embryos of Drosophila, Xenopus or Zebrafish (Hartwell & Weinert 1989). However, the 
situation is quite different in early C. elegans embryos, which not only contain an active S- 
phase checkpoint, but also activate the ATR-l/Chk-1 pathway as part of normal 
development (Brauchle, et al. 2003; Encalada, et al. 2000). 

The first division of the C. elegans zygote is unequal and generates a larger anterior 
blastomere, AB, and smaller posterior blastomere, PI. These cells give rise to different 
daughter cell lineages. For instance, PI continues an additional three asymmetric divisions 
to produce the germline precursor P4 (Sulston, et al. 1983). In addition to the different fates, 
cell division in the AB and PI lineages also occurs with a different timing, with the AB cell 
dividing approximately 2 minutes earlier than the PI cell (visible in Fig. 2). Interestingly, atl- 
1 ATR and chk-1 function contributes to this asynchrony of cell division in normal embryos 
(Brauchle, et al. 2003). Double inactivation of atl-1 and chk-1 reduced the time between 
mitotic entry (nuclear envelope breakdown) of AB and PI from 125 sec in the wild-type to 
75 sec after atl-1/chk-1 RNAi. Thus, somehow the PI blastomere might preferentially and 
highly reproducibly activate the S phase checkpoint. Asymmetric division of the zygote is 
needed for this distinction between AB and PI (Brauchle, et al. 2003). 

Preferred checkpoint activation in PI is also visible in mutants with defects in DNA 
replication, or embryos treated with HU, which inhibits ribonucleotide reductase (Brauchle, 
et al. 2003; Encalada, et al. 2000; Encalada, et al. 2005; Korzelius, et al. 2011). Both the zygote 
(P0) and PI daughter are able to delay mitosis by about 12 minutes when replication is 
compromised, while the AB daughter halts for only a few minutes. Inactivation of atl-1 
and/or chk-1 prevents these delays, indicating that this is a legitimate, though limited, S- 
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phase checkpoint response. The different response of the PI versus AB lineage has been 
interpreted as protection of the germline against replication errors. Surprisingly, however, 
the checkpoint response to DNA damage (rather than replication arrest) appears actively 
repressed in the PI lineage (Holway, et al. 2006). Bypassing the checkpoint could serve to 
maintain the relative timing of blastomere divisions, which is an essential part of 
development. 

6. The MCM helicase is needed for activation of the replication checkpoint 

Defects in some replication components trigger a checkpoint arrest, while others do not. For 
instance, partial loss of function of div-1, which encodes a DNA polymerase a-subunit, gives 
rise to substantial cell cycle delays (Encalada, et al. 2000). The same is true for inhibition of 
ribonucleotide reductase by HU treatment or rnr-1 RNAi (Brauchle, et al. 2003). However, 
mcm-4 inactivation interferes with DNA synthesis without the induction of a checkpoint 
response (Korzelius, et al. 2011). Cells in mcm-4(RNAi) embryos and mcm-4 mutant larvae 
enter mitosis at the appropriate time and continue chromosome segregation as well as cell 
division. Moreover, RNAi of mcm-4 suppressed the checkpoint delay induced by mr-1 
inhibition. These data indicate that MCM-4 is not only required for DNA replication but also 
for activation of the S phase checkpoint. Genome fragmentation has also been reported for 
cdt-l(RNAi) and cdc-6(RNAi) embryos. Thus, the assembly of a pre-replication complex 
appears to be needed to trigger the S-phase checkpoint. 

Studies in other organisms support these observations and have demonstrated that 
activation of the DNA damage and replication checkpoints requires MCM helicase activity. 
Recruitment of Replication Protein A (RPA) to single-stranded DNA is probably the actual 
checkpoint trigger (Zou & Elledge 2003). The helicase activity of MCM proteins generates 
ssDNA, through unwinding the DNA at the replication fork. Stalling of replication forks, 
e.g. after HU treatment, causes uncoupling of the MCM helicase from DNA polymerase 
activity (Byun, et al. 2005). Consequently, fork stalling leads to an accumulation of ssDNA, 
which recruits additional RPA and causes activation of the checkpoint kinases ATR and 
Chkl. The formation of replication forks and the generation of ssDNA both require MCM 
function. This explains why C. elegans mcm-4 loss of function prevents DNA synthesis 
without activation of the replication checkpoint. 

7. Endoreplication: polyploidy required for growth 

Endoreplication cycles bypass mitosis while DNA replication continues, which results in a 
doubling of the ploidy during each endocycle. Endoreplication commonly occurs in specific 
cell types during metazoan development. In C. elegans, only two tissues become polyploid 
as a result of endoreplication: the intestine and the epidermis (formally known as 
hypodermis). Intestinal cells endoreplicate during each larval stage, increasing the ploidy to 
4n at the transition from first to second larval stage and leading to intestinal nuclei with 32n 
DNA in adult animals (Hedgecock & White 1985). 

The situation in the epidermis is more complex. Epidermal nuclei reside in syncytia, sharing 
a common cytoplasm without separating membranes. The largest epidermal syncytium is 
hyp7, which covers most of the body except for regions of the head and tail (Hedgecock & 
White 1985). In each larval stage, stem-cell like precursors in the epidermis, known as 
"seam cells", divide to create novel seam cells and daughter cells that fuse with the hyp7 
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syncytium (Sulston & Horvitz 1977). Ultimately, this creates a syncytium with 133 nuclei. 
The newly created epidermal cells duplicate their genomic DNA prior to fusion, so that they 
enter the syncytium as 4n nuclei (Hedgecock & White 1985). Endoreplication has been 
reported to occur in adult stage hyp7 nuclei, although the level varies between nuclei, with 
an average ploidy of lOn to 12n in older adults (Fig. 4) (Flemming, et al. 2000; Morita, et al. 
2002; Nystrom, et al. 2002). 



Fig. 4. DNA endoreplication in the epidermis. A. Propidium iodide staining of a young C. 
elegans adult is shown, arrowhead indicate polyploid nuclei of the epidermis. B. 
Quantification of DNA content based on propidium iodide staining. Nuclei of the body 
wall muscles are used as a reference for 2n DNA content. The epidermal nuclei show 
increased ploidy with up to 8n DNA content. The DNA content of epidermal nuclei further 
increases in concert with growth of late stage adults. Each dot represents a single nucleus. 

Why these two cell types, the skin and intestine, undergo endoreplication is not fully 
understood. It has been speculated that endoreplication is used to maintain the integrity of 
these tissues, while allowing increased genome ploidy to support increases in cell volume 
and metabolic activity (Kipreos 2005). In many organisms however, endoreplication has 
been correlated with growth. Indeed, the C. elegans epidermis and intestine grow extensively 
during larval development, and endoreplication in the epidermis has been correlated with 
the size of the entire animal. 

Several observations support this conclusion. Hydroxyurea (HU) treatment of adult 
animals, in which somatic cell proliferation has been completed, prevents endoreplication in 
the epidermis as well as growth of the animals. (Lozano, et al. 2006). In contrast, tetraploid 
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animals are 40% larger in volume than wild type worms, which closely corresponds to the 
increase in epidermal polyploidy (from average 11.2n to 16.7n, in adults at 148 hrs.). 
Furthermore, the first generation homozygous cye-1 cyclin E mutants survive till adulthood 
because of maternal CYE-1 supplies and these animals show reduced endoreplication in the 
epidermis and a corresponding reduction in body size (Lozano, et al. 2006). Finally, mctn-4 
mutants fail DNA replication and are severely growth retarded and larval lethal. Specific 
expression of MCM-4 in the epidermis of such mutants is sufficient to rescue larval growth 
and lethality (Korzelius, et al. 2011). Thus, the polyploidy of the epidermis contributes to the 
body size of the adult animal. 

8. Tissue-specific regulation of DNA replication 

Interestingly, endoreplication in the epidermis is regulated by a TGF-P signal transduction 
pathway. C. elegans uses several different TGF-p pathways to control a variety of 
developmental processes, including growth. Mutations in components of this pathway lead 
to smaller and thinner adult animals (small phenotype: Sma). The ligand for the growth 
pathway is DBL-1, which is homologous to DPP/BMP-4 (Morita, et al. 2002; Suzuki, et al. 
1999). DBL-1 signals through the Type I and II TGF-P serine/threonine kinase receptors 
SMA-6 and DAF-4, respectively, to the downstream SMAD transcriptional regulators SMA- 
2, SMA-3 and SMA-4 (Savage-Dunn 2005). Notably, daf-4 and sma-2 mutants are not only 
small and thin, but also show reduced ploidy of epidermal nuclei (Flemming, et al. 2000; 
Nystrom, et al. 2002). 

A critical downstream target of the DBL-1 pathway has also been identified: lon-1 (Maduzia, 
et al. 2002; Morita, et al. 2002). Homozygous lon-1 mutant animals are longer than normal 
(Lon phenotype), while overexpression of lon-1 leads to a small phenotype. Several 
observations indicate that lon-1 acts downstream of the SMA-6 TGF-p type I receptor: 
double sma-6; lon-1 mutants are still somewhat long, and lon-1 mRNA levels are increased in 
sma-6 mutants. Surprisingly, lon-1 encodes a putative transmembrane protein, related to the 
plant pathogenesis-related protein 1 (PR-1) and human glioma-pathogenesis related protein 
(GliPR-1). This LON-1 protein is expressed and required in the epidermis, and anti-LON-1 
antibodies showed localization to apical junctions (Morita, et al. 2002). LON-1 is claimed to 
repress endoreplication, based on the increased epidermal ploidy in lon-l(el85) and lon- 
l(RNAi) adults. However, two other lon-1 mutations show somewhat reduced ploidy 
compared to wild-type (Morita, et al. 2002). Thus, although further research is needed, there 
is strong evidence that the DBL-1 ligand, produced in a set of neurons, activates a TGF- 
P/SMA pathway, which inhibits lon-1 expression in the epidermis, and thereby allows 
endoreplication and growth of the adults. 

The TGF-P/SMA/LON-1 pathway should somehow connect to the cell cycle in order to 
regulate endoreplication. Based on the Sma phenotype of cye-1 mutants, it has been 
proposed that Cyclin E is the key regulator (Lozano, et al. 2006). Cyclin E mutant mice also 
show defects in trophoblast endoreplication (Parisi, et al. 2003), and the fluctuating activity 
of Cyclin E with its kinase partner CDK-2 drives endoreplication in Drosophila (Claycomb & 
Orr-Weaver 2005; Lilly & Duronio 2005). However, TGF-P/SMA/LON-1 signaling could 
also act more upstream of cye-1, e.g., in the regulation of cyd-1 Cyclin D. cyd-1 mutants are 
also small, and C. elegans Cyclin D is needed for endoreplication, at least in the intestine 
(Boxem & van den Heuvel 2001). At least in C. elegans, Cyclin D is essential for Gl/S 
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progression and induction of S phase genes such as MCM proteins (Boxem & van den 
Heuvel 2001; Boxem & van den Heuvel 2002; Korzelius, et al. 2011). 

The cyd-1 and mcm-4 mutants show an interesting phenotypic difference. Homozygous 
mutants of either cyd-1 or mcm-4 complete embryogenesis, because of maternal supplies, 
fail DNA replication from the first larval stage onward, and show severe growth 
retardation. However, only mcm-4 mutants show larval lethality, which is fully 
suppressed by expression of mcm-4 from an epidermis specific promoter (Korzelius, et al. 
2011). The ability to arrest the cell cycle probably underlies the difference between cyd-1 
and mcm-4 mutants: post-embryonic blast cells in cyd-1 mutants arrest prior to S phase 
entry, while they continue abnormal mitosis in mcm-4 mutants. As a result, the structural 
integrity of the epidermis is lost only in mcm-4 mutants, which often causes larval death. 
Thus, not absence of DNA replication but lack of an S-phase checkpoint response may 
lead to death of the animal. 

9. In conclusion 

Studies of DNA replication in C. elegans have thus far been limited. Given the variation of 
well-established models for replication studies, which include budding and fission yeast, 
Xenopus egg extracts, cells in culture, in vitro systems and even flies, one could question 
the need for studying S phase in the worm. However, several important mechanistic 
insights have been obtained from observations of DNA replication-defective phenotypes 
in the worm. Moreover, such analyses have emphasized the variation in regulatory 
mechanisms between different developmental stages and in different cell types, 
underscoring the need for studies of replication control in a developmental context. The 
combination of its large embryonic cells, strong cell biology, genetic tractability and 
highly reproducible lineage now allows for a detailed analysis of the assembly of pre¬ 
replication and replication initiation complexes in real time in C. elegans. High-throughput 
studies have already defined RNAi phenotypes for many known DNA replication 
components, and have identified currently uncharacterized genes with similar 
phenotypes. Thus, C. elegans increasingly adds an attractive developmental animal system 
for gene discovery, functional characterizations in vivo, and live imaging of replication 
component localizations. 
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1. Inroduction 

DNA replication, the process of copying double-stranded DNA, and DNA recombination, 
the process of breaking DNA strand and joined to a different strand, are central 
characteristics of life. The aim of this chapter is to discuss the relationship between 
replication and recombination. Understanding the intimate links between these processes 
gives us a more holistic approach to understanding the functioning of a cell. 

Replication and recombination machineries cooperate to maintain biological inheritance and 
genomic integrity. While replication enables the formation of two identical DNA molecules 
from a single double-stranded DNA, recombination enables accurate repair of errors that 
occur on both strands of DNA as well as the formation of new combinations of genes. 
Recombination can occur between similar molecules of DNA (homologous recombination), 
or dissimilar molecules (non-homologous end joining). Homologous recombination 
predominantly occurs during and shortly after DNA replication (late S and G2), while non- 
homologous end joining is predominant in the G1 and early S phase of the cell cycle. 

What is the relationship between DNA replication and recombination processes? Mutations 
in some genes with a role in DNA replication cause hyper-recombination phenotypes. These 
mutations require recombination protein Rad52 for their viability, suggesting that the 
replication errors caused by mutations are repaired by recombination mechanisms (Merrill 
& Holm, 1998). Furthermore, many recombination genes have S phase defects when deleted. 
Obviously, replication and recombination processes are tightly intertwined. In this chapter I 
will try to present the close coupling between these processes from a different point of view 
(Fig. 1): recombination is part of DNA replication and, vice versa, DNA synthesis is part of 
the recombination process; both processes are connected via checkpoints; both processes are 
regulated by common posttranslational modifications; both processes take advantage of 
double helix DNA and both have a common problem with DNA unfolding. 

The recombination system plays a crucial role in DNA replication ensuring that replication 
machines can complete their task of genome duplication. DNA replication forks stall or 
collapse at DNA lesions or problematic genomic regions. When a fork becomes stalled, the 
replisome often remains firmly associated with the fork. But, when replication forks are 
removed, recombination is the most important rescue mechanism. The recombination 
mechanism forms substrates for the assembly of a new replication fork thus allowing 
continued DNA replication. 
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TOPOISOMERASES, 
HELICASES 


COMMON POST-TRANSLATIONAL 
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SUMOILATION, 
PHOSPHORYLATION, 
UBIQUITINATION 



- > - 

Fig. 1. Replication and recombination processes are tightly intertwined: recombination is 
part of DNA replication and, vice versa, DNA synthesis is part of the recombination process. 
They are affected by the common factors: checkpoint, DNA unfolding and post-translational 
modifications. 

DNA synthesis is a crucial step during the recombination process. After Rad51-mediated 
DNA strand invasion, DNA synthesis is the next step in recombination to restore the 
integrity of the chromosome. After DNA strand invasion, dissociation of Rad51 is required 
to allow access for DNA polymerases to prime DNA synthesis at the invading 3' OH end. 
Repair DNA synthesis during the recombination process is similar to normal S-phase 
replication, but has specific properties. 

A close coupling between replication and recombination is also demonstrated in the 
checkpoint system. In addition to sensing DNA damage, the role of the DNA damage 
checkpoint is also to coordinate cell cycle progression, DNA replication, and DNA repair 
mechanisms. When forks collapse during replication and generate single-stranded DNA 
ends, these ends activate the replication checkpoint. Activated replication checkpoint 
proteins further activate the recombination machinery. 

A close coupling between replication and recombination is also demonstrated in common 
posttranslational modifications. Posttranslational modifications, with their key role in 
controlling protein function, play an important role in the maintenance of genome integrity. 
Sumoylation, phosphorylation, and ubiquitination are especially important for the 
functioning of replication and recombination proteins, allowing them to execute their 
functions in a rapid and efficient manner. 

Replication and recombination mechanisms both take advantage of double helix DNA and 
have a common problem with DNA unfolding. Both processes take advantage of the 
complementary base pairing between the strands to undertake their tasks. Each strand of 
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the duplex provides a template for generating the other strand. When one strand of DNA is 
damaged, the complementary sister strand can be used as a template to regenerate the 
damaged strand and recover information encoded in its sequence. However, the 
organization of DNA sequences into a chromatin structure represents a barrier to replication 
and recombination processes. Because replication and recombination require access to the 
sequence in the interior of the double helix DNA, the chromatin structure has to be 
remodeled during these processes. Both processes require DNA topoisomerase to unwind 
the DNA helix and DNA helicase to open the DNA helix. 

This chapter mostly refers to the basic mechanisms applicable to eukaryotes that have been 
studied in budding yeast Saccharomyces cerevisiae. S.cerevisiae is the best studied eukaryotic 
model organism that providing the most integrated view of replication and recombination 
processes. At the end of this chapter, a lesser known yeast protein Ecmll is discussed, 
which affects both replication and recombination processes. 

2. Recombination during DNA replication 

DNA replication represents a sensitive period in the cell cycle because the stretched 
chromatids and the stability of the replicating forks are highly susceptible to damage. 
Failure to restart replication accurately would result in serious consequences for organism: 
accumulation of mutations, enhanced genome instability, or even cell death. Recombination 
plays a crucial role in DNA replication by ensuring that replication machines can complete 
their task of genome duplication. DNA replication forks can stall or collapse at DNA lesions 
or problematic genomic regions. At problematic replication regions, e.g. inverted repeats 
(Ty elements) and tRNA genes, the replication fork progression is slowed down and fork 
stalling is a frequent event. When forks are stalled, the replisome often remains stably 
associated with the fork; Rad53 stabilizes stalled forks by gluing the replisome onto the 
replicating molecule (Lucca et al., 2004). Resumption of DNA replication can occur when the 
block is relieved. In other situations, like DNA breaks or unremoved protein-DNA 
complexes, the replisome dissociates and the fork collapses (Lambert et al, 2005). When 
replication forks are removed, recombination is the most important rescue mechanism. The 
recombination process forms substrates for the assembly of a new replication fork. There are 
also alternatives to a replication restart. Specialized DNA polymerases, translesion 
polymerases, are able to replicate across sites of DNA damage and allow re-initiation of 
replication by normal replicative polymerases beyond the lesion (Friedberg et al., 2002). As 
translesion polymerases are mutagenic, their activation is tightly controlled. It has been 
shown that translesion synthesis is used only when a recombination mechanism is not 
available (Berdichevsky et al., 2002). 

Replication restart by recombination mechanisms is required when stalled replication forks 
are processed up to DSBs or at sub-telomeric regions where there are no converging forks 
that could complete DNA duplication. Cells take advantage of homologous recombination 
(HR) to create substrates for assembly of a new replication fork. Holliday junctions enable 
the site of the original lesion to be returned to a duplex formation and become accessible to 
repair systems. It has been shown that specific nuclear foci of Rad52 are formed during S 
phase and that each Rad52 focus represents a center of recombinational repair capable of 
processing DNA lesions (Lisby et al., 2001). Sister chromatids are the preferred partner for 
DSB repair in mitotic diploid yeast cells. It was confirmed that cohesion between sister 
chromatids is essential for efficient DSB repair in mitotic cells (Uhlmann, 2001). A major 
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damage tolerance pathway involving recombination is the template switch, which uses the 
information on the sister chromatid in the proximity of replication forks. Homologous 
recombination factors and DNA polymerase 6 promote the formation of template switch 
intermediates. These intermediates are dissolved by the action of the Sgsl helicase in 
association with the Top3 topoisomerase rather than by Holliday Junction nucleases (Vanoli 
et al., 2010). Another recombination mechanism, which has been implicated in the restart of 
collapsed forks, is break-induced replication (BIR). BIR is an efficient homologous 
recombination pathway repairing a DNA double-strand break when only one of two ends of 
the DSB succeeds in strand invasion of a homologous sequence. This recombination 
pathway includes the synthesis of a long tract of DNA that can be attributed to 
recombination-directed DNA replication and is discussed in the section about replication 
during the recombination process (section 3.3.). 

2.1 Replication checkpoint is linked to recombination machinery 

DNA damage is constantly being sensed by sophisticated cellular networks - DNA damage 
checkpoints. This is because genome integrity is continuously challenged by DNA lesions, 
DNA single-strand breaks (SSBs) and double-strand breaks (DSBs). DNA damage and 
replication checkpoint proteins monitor genome integrity, recognize different forms of DNA 
damage, transduce appropriate signals, initiate DNA repair, and slow the progression of the 
cell cycle to allow DNA repair to occur (Boddy & Russell, 2001). At critical stages before or 
during DNA replication (Gl/S and intra-S checkpoints) and before cell division (G2/M 
checkpoint), the checkpoint proteins delay or stop the cell cycle if DNA damage is 
unrepaired to prevent duplication and segregation of damaged DNA. A network of 
checkpoint proteins is tightly coordinated with cell cycle progression. Besides its acting in 
cell cycle arrest, the checkpoint response is linked to activation of the repair and 
recombination machinery (Zhou & Elledge, 2000). Thus, the role of the checkpoint system is 
very complex, in that it senses the DNA damage while also coordinating cell cycle 
progression, DNA replication and DNA repair mechanisms. The functioning of replication 
proteins, recombination proteins, and checkpoint proteins is closely intertwined. 

During DNA replication, stalled forks generate single-stranded DNA ends coated with 
replication protein A (RPA), which activates the replication checkpoint. It has been shown 
that a significant amount of ssDNA must be exposed in order to activate the checkpoint 
response (Shimada et al, 2002). Activated replication checkpoint proteins further protect the 
stability of stalled replication forks (Lopes et al., 2001) or activate recombination machinery 
when forks collapse. 

A central transducer of checkpoint proteins is the kinase Mecl (hATR), which is recruited to 
stalled forks and probably activated in response to signals generated by the clamp-sensor in 
the presence of DNA damage (Boddy & Russell, 2001). Activated Mecl phosphorylates 
Mrcl, a protein required for replisome stabilization and checkpoint activation (Katou et al., 
2003). Kinase Tell (hATM) is also a component of the common signalling pathway and has a 
role similar to Mecl. Mecl and Tell both phosphorylate serines and threonines on 
numerous target proteins, e.g. H2A, in response to DNA damage (Downs et al., 2000). 
Downstream from Mecl, the Rad53 kinase responds to DNA replication arrest. Rad53 is a 
conserved protein directly affecting replication fork stability. Mecl and Tell phosphorylate 
Rad53 and Mrcl. Phosphorylated Mrcl binds the fork-associated hyperphosphorylated 
Rad53, leading to its activation (Alcasabas et al., 2001). Kinase Rad53 phosphorylates and 
activates other protein kinases that induce other processes that enable repair of the damage. 
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The similarity between checkpoint sensors and replication proteins has been pointed out 
(Forsburg, 2002). One of the checkpoint complexes is a specialized version of the replication 
clamp-loader RFC, with the large subunit replaced by a checkpoint-specific protein Rad24. 
There is also a checkpoint version of the clamp related to PCNA (processivity clamp of DNA 
polymerases), Mec3-Radl7-Ddcl. Those proteins act as checkpoint sensors of damage and as 
effectors of DNA repair. 

2.2 Sister chromatid cohesion is coupled to DNA replication 

Cohesins are ring-shaped complexes made up of Sccl, Scc3, Smcl, Smc3, and Pds5 subunits. 
Smcl and Smc3 are structural maintenance of chromosome (SMC) proteins with ATPase 
domains. Cohesins keep the sister chromatids connected with each other, they facilitate 
spindle attachment onto chromosomes during metaphase, and their presence is essential for 
the proper transmission of sister chromatids to dividing daughter cells. 

Cohesins between sister chromatids, recombination, and replication are closely intertwined 
processes. The sister chromatid cohesion complex plays a crucial role in DNA damage 
repair by recombination mechanisms, however sister chromatid cohesion is also coupled to 
DNA replication. Cohesins are loaded onto chromatin before DNA replication. 
Establishment of cohesion between sister chromatids is coupled to DNA replication and 
requires the active participation of replication fork and other replication-related activities 
(Carson & Christman, 2001). It has been shown that proper cohesion require PCNA - a 
processivity clamp for DNA polymerases, Pol o - an essential DNA polymerase. Replication 
Factor C (RFC) - a clamp-loader complex, and cohesion establishment factors. It has also 
been shown that fork progression itself is tied to cohesin acetylation (Sherwood et ah, 2010). 
The most important cohesion establishment factor is Ecol/Ctf7 acetyltransferase. The 
mutation of cohesion factor ecol can be suppressed by overexpression of PCNA 
(processivity factor for some DNA polymerases), suggesting that cohesion during the S- 
phase might actually be mediated by replication fork components (Skibbens et ah, 1999). It 
has also been shown that Ecol physically interacts with all RFC complexes (Kenna & 
Skibbens, 2003). As Ecol is essential for cohesion establishment and the RFC complex is a 
replication clamp loader that loads PCNA onto dsDNA and promote processive DNA 
replication, these interactions provide new evidence regarding linkage between cohesion 
establishment and DNA replication. Other links between the replication machinery and the 
cohesion machinery are DNA polymerase Trf4/Polo and components of the replisome 
progression complex (RPC). Because defects in fr/4 mutants result in a failure of cohesion 
between the replicated sister chromatids, it was concluded that Trf4/Polo is required to 
establish cohesion during S-phase (Wang et ah, 2000). 

While the multiprotein cohesion complexes depend on loading and establishment factors 
that are intimately connected to DNA replication, it has also been shown that cohesion 
establishment factors are implicated in maintaining the processivity and stability of 
replication forks (Branzei & Foiani, 2010). Interaction between replication forks and cohesin 
rings stabilize replication forks and/or enable sister chromatid-dependent restart pathways. 
Some of the components, such as the Tofl-Csm3 complex and Mrcl, appear to be involved 
in maintaining fork integrity and are recognized as components of the replication 
checkpoint (Katou et ah, 2003). On the other hand, Tofl-Csm3 complex and Mrcl are 
implicated in sister chromatid cohesion (Mayer et ah, 2004; Xu et ah, 2004). 

In conclusion, DNA replication and cohesion establishment are intimately coordinated 
(Sherwood et ah, 2010): replication forks, travelling along the DNA, modify cohesin's 
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architecture and association with chromatin, and on the other hand, cohesion in turn affects 
the dynamics of DNA replication and keeps the sister chromatids connected to each other. 

3. DNA synthesis during recombination 
3.1 Recombination process 

For the cell, it is essential that DNA breaks are repaired promptly to prevent chromosomal 
aberrations or cell death. Recombination is a key pathway to preserve genomic stability by 
repairing DNA breaks, gaps, and interstrand crosslinks. Recombination is an essential DNA 
repair process, particularly for the repair of DNA-double strand breaks (DSBs). The 
potentially, very dangerous DSBs are repaired by different recombination pathways: non- 
homologous end joining (NHEJ), single-strand annealing (SSA), break-induced replication 
(BIR; outcome is half-crossover), synthesis-dependent strand annealing (SDSA; outcome is 
noncrossover), and double Holliday junction pathways (dHJ; outcome is crossover or 
noncrossover). In yeast cells, DSBs are generally repaired by noncrossover/gene conversion 
pathways. The first two pathways, NHEJ and SSA, can occur within one molecule of DNA. 
DSB resection is a required step in SSA as in HR, but not in NHEJ where DNA end-binding 
factors Ku70-Ku80 inhibit DSB resection (Tomita et ah, 2003). NHEJ is favoured over SSA 
and HR in G1 phase. 

The last three pathways (dHJ pathways, SDSA, BIR) are template dependent processes 
involving another molecule of DNA, a sister chromatid or a chromosome. They are assigned 
to homologous recombination pathways (HR) that require Rad52 and Rad51. In the first 
phase of HR, DNA damage is processed to form a single-strand DNA (ssDNA) region that is 
bound by ssDNA-binding replication protein A (RPA). RPA has an important role during 
recombination. Its primary role is to antagonize secondary structures in the ssDNA and 
favor formation of the active Rad51-ssDNA filament. Besides that function, RPA is able to 
bind the displaced strand in the D-loop and stabilize the invasion intermediate (Eggler et al., 
2002). In the second phase, the Rad51-ssDNA filament performs a homology search and 
invades the donor DNA strand. The displacement loop (D-loop) is formed and within the D- 
loop the invading strand primes DNA synthesis. Motor protein Rad54 enhances D-loop 
formation by Rad51. In the third phase, the pathways of HR are distinguished. Subpathways 
of HR require specific protein cascades that have been only partially defined to date (Heyer 
et al., 2010): 

• When only one homologous DNA strand is found, the D-loop is converted to a 
replication fork and cell choose the BIR pathway of repair. This process enables the 
repair of the chromosome, but leads to loss of heterozygosity of genetic information 
distal to the DSB. 

• When both ends of the break interact with homologous sequences that are used as a 
template for repair, the predominant pathway for DSB repair is SDSA. The D-loop is 
reversed and allows the annealing of the newly synthesized strand with the resected 
strand of the donor end. This pathway avoids crossovers and reduces potential genomic 
rearrangements. 

• Double Holliday junction pathways with crossover or noncrossover outcome are prone 
to noncrossover products in the vegetative cell cycle. Crossovers are primarily intended 
for meiotic formation of new combinations of genes. In vegetative cells, DNA repair is 
infrequently associated with crossovers, because crossovers can lead to deletions, 
inversions, or translocations when non-allelic or repeated sequences are present. 
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3.2 Chromatid cohesion and recombination mechanisms 

Recombination mechanisms use a genetically similar DNA molecule, usually the sister 
chromatid, to repair damaged DNA and maintain the original sequence information. The 
sister chromatid cohesion complex plays an important role in DNA damage repair by 
recombination mechanisms, because cohesin holds the broken chromatid in close proximity 
to its intact sister chromatid, thereby supporting efficient repair. On formation of DSBs, 
cohesin even accumulates and forms new cohesive structures on chromatin that surrounds 
the break site. The ability of cohesin to establish new cohesion structures is essential for its 
role in DNA damage repair, possibly by physically stabilizing fragmented parts of the 
chromosome. Accumulation of cohesin is controlled by the DNA damage response and 
cohesin-regulating factors. Cohesion formation is limited to the S phase of the cell cycle, but 
DSBs can also trigger cohesion after DNA replication has been completed (Strom et al., 
2007). It has been shown that damage-induced cohesion is also essential for recombinational 
repair in postreplicative cells. Cohesion is established genome-wide after induction of a 
single DSB. Thus, a cohesion establishment pathway exists that is independent of DNA 
duplication but required for DNA repair. Cohesin may act as a platform for the recruitment 
of checkpoint and DNA repair proteins and helps to translate checkpoint signals into DNA 
repair. 

3.3 DNA synthesis during the recombination process 

DNA synthesis is a part of the recombination process. After Rad51-mediated DNA strand 
invasion, DNA synthesis is a crucial step in recombination to restore the integrity of the 
chromosome. Motor protein Rad54, which stimulates D-loop formation, also promotes the 
dissociation of Rad51 from the heteroduplex after DNA strand invasion (Kiianitsa et al., 
2006). Dissociation of Rad51 is required to allow access for DNA polymerases to prime DNA 
synthesis at the invading 3' OH end. Repair DNA synthesis during the recombination 
process is similar to normal S-phase replication, but has specific properties: 

• The DNA polymerases required for DNA synthesis in homologous recombination 
remain to be defined; DNA polymerases Pol 5 (Lydeard et al., 2007; Maloisel et al., 
2008) and Pol q (Mcllwraith et al., 2005) have been proposed. 

• Recently it was shown that PCNA, the sliding clamp that locks DNA polymerase onto 
dsDNA and promotes processive DNA replication, has a specific role in the initiation of 
recombination-associated DNA synthesis (Li et al., 2009). It was shown that PCNA is 
required for Pol 5 recruiting to a D-loop and the extension of the 3'-end of the invading 
strand. PCNA and RFC are essential even to synthesize as little as 30 nucleotides 
following strand invasion (Holmes & Haber, 1999). 

• DSB-induced gene conversion does not require lagging-strand components like Pola- 
primase complex and Cdc45, which is responsible for loading the Pola-primase 
complex onto chromatin during DNA replication (Wang et al., 2004). It has been 
proposed that new DNA synthesis is initiated through the leading-strand, using the 3' 
end that was generated by resection, and that Rad51-mediated strand invasion is used 
as the primer end with primase activity. On the other hand, BIR do require lagging- 
strand components and major replicative DNA polymerases. 

• Recombination mediated replication does not require initiation replication proteins or 
additional MCM (Minichromosome Maintenance complex) loading. Normal DNA 
replication requires formation of a pre-RC (pre-replicative complex) in early Gi. Cdc6 
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and Cdtl bind to the chromatin binding ORC (origin recognition complex) and enable 
the recruitment of Mcm2-7. Cdc6 accumulates at the end of mitosis and disappears after 
the initiation of DNA replication, and Cdtl accumulates in the nucleus during Gi and is 
excluded from the nucleus for the rest of the cell cycle to prevent second-round 
replication. Furthermore, Mem proteins are largely excluded from the nucleus in G 2 
phase. Thus, ORC and pre-RC should not be required for DSB repair. This hypothesis 
was confirmed in DSB-induced gene conversion during MAT switching (Wang et al., 
2004). 

• DNA repair synthesis is not as efficient and processive as normal replication. The 
explanation for this reduced efficiency could be the lack of processive Mem helicase. 

3.4 Break induced replication 

DSB repair by homologous recombination most often involves only a short section of new 
DNA synthesis. On the other hand, the BIR homologous recombination pathway includes 
the synthesis of very long tract of DNA; therefore, this pathway can be attributed to 
recombination-directed DNA replication. Break induced replication (BIR) is a homologous 
recombination pathway that is able to repair a DNA double strand break (DSB) when 
homology is restricted to one end. Recombination enables the formation of a unidirectional 
replication fork that leads to extensive replication and copies the donor chromatid to the end 
of the chromosome. 

BIR requires leading and lagging strand replication machinery, all three major replicative 
DNA polymerases, including the otherwise nonessential Pol3 (subunit of Pol5), and 
recombination proteins Rad51 and Rad52. In contrast to other HR pathways, BIR also 
requires Cdc45, GINS, and Mcm2-7 that form the functional replicative DNA helicase; and 
initiation factor Cdtl that recruits Mcm2-7 to the pre-RC (Lydeard et al., 2010). The origin 
recognition complex (ORC) and Cdc6, parts of the pre-replication complex (pre-RC) are not 
required, while initiation replication protein Cdc7 is required for BIR. BIR is an effective 
repair mechanism, characterized by origin-independent and recombination-dependent 
replication. The posttranslational modifications sumoylation and ubiquitination, have a 
special role in BIR. Sizl mediated sumoylation and Radl8-mediated ubiquitination of the 
processivity clamp PCNA are involved in BIR, but not in gene conversion. 

BIR can occur outside of the S phase (Bosco & Haber, 1998), but it is not yet known how 
replication machinery is then formed. In BIR, Mcm2-7 and Cdtl have to cope with a single 
recombination fork, while outside of the S phase they are exported from the nucleus or 
degraded to prevent reinitiation of replication. 

3.5 Replication and recombination in a radiation-resistant organism 

Bacteria Deinococcus radiodurans is one of the most radiation-resistant organisms yet 
discovered. It is capable of surviving extremely high doses of acute (10,000 Gy) and chronic 
(60 Gy/hour) ionizing radiation and it is resistant to other DNA damaging conditions 
including exposure to desiccation, ultraviolet (UV) light, and hydrogen peroxide. It has been 
shown that the extraordinary robustness of D.radiodurans depends on efficient proteome 
protection (Krisko & Radman, 2010). Beside manganese accumulation and specific protein 
protection, survival depends on the efficient and rapid reconstitution of a functional genome 
from numerous DNA fragments. Recombinational repair with extensive DNA synthesis is 
used (Slade et al., 2009). DNA repair is primed by recombination proteins RecA and RadA, 
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while the elongation step is performed by Pol III and Poll. Pol III activity appears to be a 
prerequisite for initiating DNA repair synthesis, whereas Pol I enables its continuation. 
Deinococcus radiodurans has more complementary or homologous DNA fragments to 
engage as repair partners, but otherwise the steps of repair are conspicuously normal 
(Galhardo & Rosenberg, 2009). The key repair mechanism is the combination of long tracts 
of newly synthesized DNA and homologous recombination. 

4. Meiosis 

The meiotic cell cycle in nature does not happen as often as vegetative cell cycle, but is even 
more special, difficult, and controlled. Meiosis is a complex developmental program that 
leads to the generation of haploid spores (gametes) from diploid cells. The central feature of 
meiosis is homologue pairing and meiotic interhomologue recombination that is enabled by 
meiosis-specific factors. After chromosome pairing and recombination, two successive 
segregations occur: MI is a reductional division, in which homologous chromosomes 
segregate and Mil is an equational division, in which sister chromatid segregate. Meiotic 
recombination is essential for proper chromosome segregation at MI where homologous 
chromosomes are synapsed and then distributed to opposite poles. 

4.1 Meiotic replication 

It has been known for a long time that cohesion between homologs, which is essential for 
crossing over, is established during meiotic replication and that the length of meiotic 
replication is modified by recombination factors like Spoil and Rec8. Recombination 
initiation is directly linked to meiotic replication. It has been shown that delaying meiotic 
replication also delays specific meiotic break formation and it has been suggested that DSB 
formation occurs as part of the process initiated by DNA replication (Borde et al., 2000). 
These facts lead to the conclusion that meiotic replication plays a fundamental role in the 
meiotic recombination process. 

The meiotic replication program is generally quite similar to that of the vegetative cell cycle: 
the same origins of replication and the same replicative machinery are used (Budd et al., 
1989). But there are specific features that distinguish mitotic and meiotic replication. The 
most noticeable feature is a protracted meiotic replication that is universally several times 
longer than mitotic replication in the same organisms. The difference has been proposed to 
be related to the process of laying down specialized chromosome features utilized at later 
stages for interhomolog interactions; the progression of the S-phase might be directly 
coupled to this (Cha et al., 2000). Pairing of homologues is a complex process that is 
essential in meiosis but not required in mitosis; thus, it is not surprising that DNA 
replication and preparing for the next phase in meiosis takes more time. 

4.1.1 Mum2 in meiosis 

The study of the Mum2 protein reveals a direct correlation between the level of meiotic DSBs 
and DNA replication (Davis et al., 2001), which was suggested on the basis of the 
observation that a delay in replication results in a similar delay in DSB formation (Borde et 
al., 2000). Mum2 is essential for meiotic replication, but it is not required for mitotic DNA 
replication. Its role in meiotic replication is not clear; however, synthetic interactions with 
the ORC complex and polymerase a-primase suggest that it is involved in the functioning of 
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the replication machinery. It was shown that in mum.2 mutants, chromosome pairing and 
synapsis occur, although at reduced levels compared to wild type, but meiotic 
recombination is greatly impaired. Meiotic gene expression, pairing, and synapsis were only 
partly reduced in mum2 mutants, thus it was suggested that the reduction in DNA 
replication is directly responsible for the reduced level of DSBs and meiotic recombination 
(Davis et al., 2001). 

4.1.2 Ime2 in meiotic replication 

Ime2 is an essential meiotic protein that links DNA replication with the following phases of 
the meiotic cell cycle that follow it. IME2 belongs to early meiotic genes that are regulated 
by Imel, while Ime2 is the key protein for the transcription of the middle genes. The 
transcription of the late genes is indirectly dependent on Imel, Ime2, and Ndt80. The early 
genes are those which encode proteins involved in DNA replication, synapsis of homologs 
and meiotic recombination, whereas the middle genes encode proteins required for nuclear 
division and spore formation. Correlation between meiotic DNA replication and nuclear 
division is regulated by an early meiosis-specific gene. Ime2, which controls both initiation 
of premeiotic DNA replication and the transcription of NDT80, has a key role in the 
correlation between premeiotic DNA replication and nuclear division. It was also 
demonstrated that Ime2 functions as a negative regulator of Clb-Cdc28 (Gurevich & Kassir, 
2010 ). 

4.1.3 Clb5,Clb6-Cdc28 in meiotic replication 

Many proteins with a role in meiotic replication and recombination are phosphorylated 
suggesting that kinases could play an important role in coordinating both processes. Kinases 
Clb5,6-Cdc28 and Cdc7-Dbf4, which initiate meiotic replication and DSB formation could 
have a key coordination role (Wan et al., 2008). 

Clb5,Clb6-cyclin-dependent kinase (Clb5,6-Cdc28) is the principal activator of the S phase in 
the vegetative cell cycle, but the absence of Clb5 and Clb6 confers only a delay in the 
initiation of replication. They are replaced by functionally related B-type cyclins Clbl and 
Clb4 (Schwob et al., 1994). In contrast, Clb5 and Clb6 are essential for the initiation of 
meiotic replication; in the clb5clb6 double mutant, meiotic replication is not detectable 
(Stuart & Wittenberg, 1998). Beside its role in initiating DNA replication, Clb5,6-Cdc28 also 
has a crucial role in the promotion of DSB formation in meiosis. Mutants clb5 and clb5clb6 
but not clb6 are defective in DSB induction and SC formation (Smith et al., 2001). DSBs are 
formed by Spoil protein and accessory factors like Mer2. It has been shown that Clb5,6- 
Cdc28 phosphorylate Mer2 and that this phosphorylation modulates interactions of Mer2 
with other DSB proteins and is critical for DSB formation (Henderson et al., 2006). 

4.1.4 Cdc7 kinase in meiosis 

Cdc7 is an essential kinase that forms complexes with the regulator Dbf4, the levels of which 
fluctuate during the cell cycle. Cdc7 is a key regulator of meiosis, because it is required for 
the initiation of DNA replication, for meiotic recombination, for regulation of the NDT80 
promoter, and for proper recruitment to kinetochores. 

In the vegetative cell cycle, phosphorylation of the MCM complex by Cdc7p-Dbf4 activates 
DNA replication. Loss of Cdc7 function affects initiation at all origins, even at late origins 
(Bousset & Diffley, 1998). An allosteric model has been proposed in which phosphorylation 
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of Mcm2 by Cdc7p-Dbf4 confers an allosteric change on Mcm5p and produces an active, 
'rolling' MCM helicase that can move with the replication fork (Sclafani, 2000). As in 
mitosis, the Cdc7-Dbf4 complex is essential for initiation of DNA replication during meiosis 
(Valentin et al., 2006). 

Besides the role in initiation of meiotic replication, Cdc7 kinase also has a role in the meiotic 
recombination. It has been shown that Cdc7 is essential for meiotic DSBs and meiosis I 
progression. Cdc7 enables phosphorylation of the DSB protein, Mer2. It has been proposed 
that Cdc7 regulates Spoil loading to DSBs by Mer2 phosphorylation (Sasanuma et al, 2008). 
In meosis, Cdc7 kinase also provides the link between replication and chromosome 
segregation. It has been shown that Cdc7 promotes meiotic progression by enabling 
transcription of NDT80; in addition it is required for mono-orientation of sister kinetochores 
in MI by allowing recruitment of the monopolin subunit, Maml, onto kinetochores (Lo et al., 
2008). 

4.2 Recombination in meiosis 

The unique features of meiosis are homologue pairing and inter-homologue recombination 
that is initiated by double-strand breaks. The specific feature of meiotic recombination is the 
presence of meiosis-specific factors that influence the mitotic recombination machinery to 
use the homologous chromosome instead of a sister-chromatid as the recombination 
template and to counteract mitotic favorization of non-crossover outcomes. 

Meiotic recombination occurs during the extended prophase. In contrast to mitotic 
recombination, meiotic recombination is genetically programmed and induced by Spoil- 
catalyzed DSBs. Therefore, in meiosis there is an additional source of DSBs that requires 
active recombination to be resolved. DSBs are resected to expose 3 1 single strand overhangs 
by the Mrell-Rad50-Xrs2 complex and endonuclease Sae2. Overhangs of ssDNA can load 
Rad51 and meiosis-specific recombinase Dmcl to form nucleoprotein filaments that catalyze 
strand invasion on the homologous chromosome (Hayase et al., 2004). In contrast to mitotic 
cells, meiosis-initiated DSBs are repaired using the homologous chromosome, not the sister 
chromatid, as a template for repair. Strand invasion generates DNA synthesis and ligation 
that result in the formation of double Holliday junction structures (dHJ). dHJ generated 
between homologs are resolved by structure-specific endonucleases. Homologous 
recombination products (crossovers and non-crossovers) are formed. While crossovers (Cos) 
absolutely require double a Holliday junction (dHJ) intermediate, non-crossovers (NCOs) 
can occur by synthesis-dependent strand annealing (SDS) or by special cleavage of dHJ. The 
resolution of dHJ depends on structure-specific endonucleases. COs preferentially occur at 
hot spots that are influenced by DNA sequence and higher order chromosome structures, 
but are strongly suppressed in large chromosomal regions near the telomeres, centromeres, 
rRNA genes, and Ty elements (Petes, 2001). COs are associated with reciprocal exchanges of 
chromosome arms and promote accurate chromosome segregation in MI, while non¬ 
crossovers are not. Because of its key role in meiosis, crossover outcome is tightly controlled. 
At least one CO is formed per chromosome pair and a crossover in one region makes it less 
likely that another will be found nearby (crossover interference). The COs/NCOs ratio can 
change to maintain COs. It has been shown that COs tend to be maintained in spoil 
mutants with reduced DSBs at the expense of NCOs (Martini et al., 2006). Spoil is loaded to 
DNA after homolog pairing; therefore, crossover control is likely based in earlier processes. 

It has been shown that noncrossover products are formed at the same time as dHJ 
intermediates, while crossovers appear later, when dHJ intermediates are resolved (Allers & 



44 


DNA Replication and Related Cellular Processes 


Lichten, 2001). ndt80 mutants arrest with unresolved dHJ intermediates and very few 
crossovers, while noncrossover heteroduplex products are formed at normal levels and with 
normal timing. These results suggest that crossovers are formed by resolution of dHJ 
intermediates, while most noncrossover recombinants arise by a different, earlier pathway 
and that dHJ resolution is under control of meiosis-specific transcription factor Ndt80. 
Noncrossover recombinants are most probably formed by SDSA, where the invading strand 
is not captured but is instead displaced. The other possibility is that dHJ intermediates are 
differentiated at an early stage and intermediates of noncrossovers are resolved very 
rapidly. The latter pathway of noncrossover formation through dHJ is less possible and cells 
probably use it only as an alternative pathway when crossovers could not be carried out. 

4.2.1 ZMM proteins 

ZMM proteins are components of SC (Zipl, Zip2, Zip3, Zip4), mutS homologues (Msh4, 
Msh5), and meoisis specific DNA helicase Mer3 (Borner et al., 2004). ZMM proteins play a 
crucial role in crossover formation; despite efficient initiation of recombination, crossovers 
are reduced or absent in zmm mutants. Elimination of crossovers is not due to reduced DSBs 
formation, because DSBs accumulate to wild-type levels in zmm mutants. The formation of 
NCOc is independent of ZMM proteins. Thus, ZMM are specifically required for formation 
of COs. ZMM proteins interact with recombination proteins, e.g. Mrell, Rad51, and Rad57 
(Agarwal & Roeder, 2000). In meiosis there are two types of COs: one type of CO that is 
dependent on the activities of ZMM proteins and a second type of CO that is ZMM- 
independent. COs are not randomly assigned, but mostly they do not occur in adjacent 
chromosomal regions; this property is called CO-interference. The ZMM-dependent COs 
represent the majority of COs in yeast and are subject to interference, while ZMM- 
independent COs do not exhibit interference. 

ZMM proteins are essential for the assembly of SC formation; Zipl is an integral component 
of the SC. They play an important role in coordination between molecular recombination 
events and assembly of the SC. 

The exact function of ZMM proteins in meiosis still remains to be clarified; although they 
probably affect strand invasion. 

4.2.2 Synaptonemal complex 

Meiotic recombination depends on the development of the meiosis-specific chromosome 
structure synaptonemal complex (SC). SC is a protein network that pairs homolog 
chromosomes along their entire length and holds them together. The cohesion complex 
consists of one filament (lateral elements) on each homologue and central elements linking 
lateral elements a in ladder like structure. Central elements contain various proteins, 
including Zipl (Sym et al, 1993) and SUMO (Cheng et al., 2006). Lateral elements contain 
meiosis-specific proteins, e.g. Redl, Hopl, Mekl/Mre4, and a cohesion complex that is 
similar to the mitotic sister chromatid cohesion complex, only that SCC1 is replaced by Rec8 
in meiosis (Xu et al., 2005). It has been shown that meiosis-specific kinase Mekl suppresses 
meiotic intersister DSB repair by working directly on sister chromatids and that Mekl acts 
on Rad51-specific recombination processes (Callender & Hollingsworth, 2010). 

4.2.3 Cdc28 in meiotic recombination 

Most components of SCs including Zipl, Hopl, Redl, Mre4, and Rec8 are phosphoproteins, 
although the functional significance of this phosphorylation remains to be characterized 
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(Zhu et al., 2010). It was shown recently that phosphorylation of Redl is carried out in 
Cdc28-dependent and Cdc28-independent manners and that Redl phosphorylation is 
independent of meiotic DNA recombination (Lai et al., 2011). Cdc28 is the only CDC (cyclin 
dependent kinase) in S.cerevisiae and plays the main role in cell cycle regulation. From G1 to 
S phase and in S phase Cdc28 is bound to B-type cyclins Clb5,6-Cdc28 and this complex 
plays a role in entry into the pre-mitotic and pre-meiotic S-phase. But in meiosis, the Clb5,6- 
Cdc28 complex also has a role in the recombination process. It regulates DSBs formation by 
phosphorylation of Mer2, a protein that is bound to Spoil (Henderson et al., 2006). Mrell- 
Rad50-Xrs2 complex and Sae2 endonuclease are necessary to remove the covalently attached 
Spoil from the DNA ends. It was shown that phosphorylation of Sae2 by Cdc28 is required 
to initiate meiotic DSB resection by allowing Spoil removal from DSB ends (Manfrini et al., 
2010). Cdc28 is required for the processing of DSBs by providing a mechanism for 
coordinating DSB resection with progression through the meiotic prophase. But localization 
of Clb5,6-Cdc28 is independent of DSB formation and is rather dependent on meiosis- 
specific chromosome components such as Redl, Hopl, and a cohesin subunit Rec8. Recently 
it was found that compromised Cdc28 activity in the meiotic prophase leads to defective SC 
formation without affecting DSB formation suggesting that CDK-dependent 
phosphorylation regulates formation of SC (Zhu et al., 2010). Clb5,6-Cdc28 obviously 
promotes not only the onset of premeiotic DNA replication but also the formation of meiotic 
DSBs and SC formation. 

4.2.4 Ime2 in meiotic recombination 

During meiosis, Ime2, a meiosis-specific protein kinase, assumes some function of the Cdc28 
complexes. Ime2 is required for critical events in meiosis, including DNA replication 
(section 3.1.2.). Transition from Gi to S phase is prohibited by the Clb5,6-Cdc28 inhibitor Sicl 
in both mitosis and meiosis. In mitosis, inactivation of Sicl by phosphorylation is catalyzed 
by the Clnl,2-Cdc28 complex (Nash et al., 2001), while in meiosis it is catalyzed by Ime2 
(Dirick et al., 1995). Ime2 is obviously required for the initiation of DNA replication in 
meiosis. On the other hand, Ime2 interacts with RPA, a complex required for stabilization of 
ssDNA during replication and recombination. In meiosis, Ime2 is required for full RPA 
phosphorylation (Clifford et al., 2005), thus Ime2 affects meiotic recombination processes as 
well. 

4.2.5 Kinases Mecl and Tell in meiotic recombination 

Mecl (hATR) is an essential protein that mediates two important functions: S-phase 
checkpoint responses (2.1.) and meiotic recombination. Kinases Mecl and Tell 
phosphorylate the axial element protein Hopl and promote meiotic recombination using an 
intact homologous non-sister chromatid rather than a sister chromatid (Carballo et al., 2008). 
Without this specific phosphorylation, meiotic DSBs are repaired via a Dmcl-independent 
intersister repair pathway, resulting in diminished interhomolog crossing-over. 
Phosphorylation of Hopl by Mecl/Tell is required for activation of Mekl, a meiotic 
paralogue of the DNA-damage effector kinase, Rad53p. 

It was proposed that Mecl and Tell also modulate the switch from the open to closed state 
of telomeres (Dubrana et al., 2001). Telomeres are not elongated every cell cycle. They can 
switch to an open state for telomerase recruitment. 
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4.2.6 Mrell in meiotic recombination 

The Mrell-Rad50-Xrs2 complex is required for the initiation of meiotic recombination - for 
programmed DSBs catalyzed by Spoil and for break end resection, which is a key step in 
homologous recombination. It has been proposed that Mrell assembles on the DSB sites 
and ensures a link between DSB formation and the processing of break ends (Borde et al., 
2004). 

4.2.7 Spoil and Rec8 in meiosis 

The Spoil protein is the catalytic subunit of the meiotic DSB transesterase and is directly 
responsible for the initiation of meiotic recombination. Rec8 is a meiosis-specific protein 
required for meiotic sister chromatid cohesion and meiotic recombination. It has been 
shown that recombination proteins Spoil and Rec8 also play distinct roles in meiotic DNA 
replication (Cha et al, 2000). They modulate the progression of meiotic replication: the 
mutation of spoil decreases the length of the S phase, while mutation of rec8 increases the 
length of the S phase. It was proposed that meiotic S-phase progression is linked directly to 
the development of specific chromosomal features required for meiotic interhomolog 
interactions. During the process of meiotic replication, the future DSB sites are probably 
marked. The time delay between replication and DSBs may reflect the time required to 
assemble the proper protein complexes at the target sites. 

5. Common problem: DNA unfolding 

Organization of DNA sequences into a chromatin structure represents a barrier to processes 
acting on DNA, like replication and recombination. During these processes chromatin 
structure has to be remodeled and reestablished after DNA replication and repair. 
Replication and recombination need access to the sequence in the interior of the double helix 
DNA and both require DNA topoisomerase to unwind the DNA helix, DNA helicase to 
open the DNA helix, and ssDNA-binding protein to protect single stranded DNA. 

5.1 DNA topoisomerase and DNA helicase 

Topoisomerases are enzymes that are able to unwind and wind DNA. They bind to DNA 
and cut the phosphate backbone of the DNA, the DNA molecule can then untangle, and the 
untangled DNA is reconnected. Thus, their function in replication, transcription, and 
recombination is essential. In S. cerevisiae, there are three types of topoisomerases: Topi and 
Top2 that relax positively and negatively supercoiled DNA, and Top3 that relaxes single- 
stranded negatively-supercoiled DNA. 

Helicases are able to separate strands of the DNA double helix using energy from ATP 
hydrolysis and are also required for replication, transcription, and recombination. Helicases 
Sgsl, Srs2, Rrm3, Dna2, and Rad3 form a complex interacting network with other DNA 
replication and recombination proteins to preserve genomic integrity. Sgsl and Srs2 are 
involved in genome integrity maintenance and meiotic recombination; Rrm3 is involved in 
telomere and rDNA replication, and Tyl transposition; nuclease and helicase Dna2 is 
required for Okazaki fragment processing and DNA repair; Rad3 is involved in nucleotide 
excision repair and transcription (SGD; http://www.yeastgenome.org/). 

Sgsl is a helicase of the RecQ family, similar to human BLM and WRN proteins that are 
implicated in Bloom and Werner syndromes. Sgsl interacts with many proteins and has 



The Relationship Between Replication and Recombination 


47 


very complex function, which is a good example of the cooperation between replication and 
recombination. 

• Sgsl and Srs2 helicases control distinct pathways of HR during DNA replication and 
the restart of stalled replication forks. Recombination-dependent cruciform structures 
are formed on damaged chromatids and Sgsl and Srs2 counteract their formation 
(Liberi et al., 2005). It was shown that the double mutation sgslsrs2 leads to a synthetic 
growth defect and that a mutation in rad51 fully rescues the sgsl srs2 synthetic defect 
(Gangloff et al., 2000). It was therefore concluded that defects in sgsl srs2 mutants are 
caused by misregulated HR during DNA replication. 

During regulation of HR, Sgsl functions in a complex with Top3 and Rmil, a DNA- 
binding protein with a preference for cruciform structures (Mullen et al., 2005). The 
Sgsl-Top3-Rmil complex is crucial for resolution of recombination intermediates such 
as Holliday junctions. The complex interacts with recombination protein Rad51. It was 
shown that Rad51 helps recruit Sgsl-Top3 to sites of replicative damage (Shor et al., 
2002 ). 

• During meiosis Sgsl negatively regulates not all HR mechanisms, but specifically 
crossovers (Rockmill et al., 2003). The mutation sgsl leads to an increase in closely 
spaced crossovers without an increase in non-crossover products (Oh et al., 2007). The 
anti-crossover activity of Sgsl is opposed by the pro-crossover activities of the ZMM 
proteins. Srs2 play a similar role as Sgsl. Double mutants sgslsrs2 increase crossovers 
and overexpression of SRS2 nearly eliminates crossovers (Ira et al., 2003). Sgsl and Srs2 
obviously regulate the processing of recombination intermediates during DNA 
replication and promote non-crossover products. 

Sgsl also functions in a Sgsl-Top3-Rmil complex during meiosis. Mutants spoil that 
have no recombination between homologous chromosomes, still form viable spores if 
there is also a top3 mutation (Gangloff et al., 1999). This result indicates that Top3 is 
required to complete recombination successfully. 

• Sgsl has a direct role in the DNA damage response during mitotic DNA replication. 
Sgsl is one of the components of the S-phase checkpoint response, which senses DNA 
damage or a blocked fork progression during DNA replication (Frei & Gasser, 2000). 
Sgsl associates with signal-transducing kinase Rad53 in S-phase-specific foci. 

• Sgsl interacts with the Top2 topoisomerase, which is the major mitotic post-replication 
decatenase. In top2 mutants, chromosomes segregate improperly, leading to 
chromosome loss and disomy. Because Sgsl acts along the same pathway as Top2, it 
has been suggested that Sgsl-Top2 is important for the decatenation of sister 
chromosomes (Watt et al., 1995). In meiosis, the Sgsl-Top2 complex has the similar role. 

5.2 ssDNA-binding protein 

Replication protein A (RPA) binds strongly and in long clusters to ssDNA and form a 
nucleoprotein filament. Because of its protective function on ssDNA ends, RPA has a central 
role in DNA repair, meiotic recombination, and also in replication checkpoints. 

RPA recruits checkpoint kinase Mecl (hATR) to ssDNA. Such activated Mecl further 
phosphorylates its substrates during the DNA damage checkpoint response. It has been 
proposed that RPA is needed to determine the severity of DNA damage and also to initiate a 
full checkpoint response (Pellicioli et al., 2001). DNA damage and the extent of ssDNA are 
probably monitored by the binding of the RPA to ssDNA (Lee et al., 1998). 
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6. Post-translational modifications 

Post-translational modifications, with their key role in controlling protein function, play an 
important role in the maintenance of genome integrity. Sumoylation, phosphorylation, and 
ubiquitination are especially important for the operation of replication and recombination 
proteins in order for them to execute their functions in a rapid and efficient manner. While 
ubiquitination leads in most cases to proteasomal degradation of the target protein, there are 
also many non-proteolytic functions of ubiquitin (Ulrich & Walden, 2010). Monoubiquitylation 
is not directed at proteolysis; it often reveals a new binding domain. Polyubiquitin chains that 
are linked through distinct Lys residues of ubiquitin also have functions that are independent 
of proteolysis; they might activate kinases or provide protein interaction changes (Hicke et al., 
2005). Non-proteolytic functions of monoubiquitylation and polyubiquitylation are 
particularly important in the control of the DNA damage response and DNA repair pathways. 
The small ubiquitin-like modifier (SUMO), which modifies proteins by regulation of their 
protein-protein interactions, allows the assembly and disassembly of protein complexes to be 
modulated. Sumoylation and ubiquitination both occur as the result of sequential action of 
specific enzymes: activating enzyme (El), conjugating enzyme (E2), and ligase (E3). SUMO 
and ubiquitin are covalently attached to the target protein; sometimes it is the same lysine 
residue of a target protein that can be alternatively modified by either SUMO or ubiquitin. 
Phosphorylation, the addition of a phosphate group (P04) to a target protein, results in a 
conformational change in the protein and consequently leads to its altered activity. Reversible 
phosphorylation is catalysed by kinases (phosphorylation) and phosphatases 
(dephosphorylation). Beside its central post-translational role, phosphorylation is also a 
regulator of the SUMO pathway (Hietakangas et al., 2006). 

Here are some examples of post-translational modifications of replication and 
recombination factors: 

• RecQ helicase Sgsl is specifically sumoylated under the stress of DNA double strand 
breaks (Lu et al, 2010). SUMO together with the Sgsl-Top3 complex counteracts the 
accumulation of crucial intermediates at replication forks during replication resumption 
processes (Branzei et al., 2006). Sumoylation of Sgsl at K621 is uniquely required for its 
role in telomere-telomere recombination for Type II cells. 

• Repair proteins yKu70, Smc5, and Smc6 are modified by SUMO (Zhao & Blobel, 2005). 
Smc5 is sumoylated by Mms21 SUMO ligase in response to DNA damage and forms a 
complex with Smc6 that has a role in DNA repair and segregation of repetitive 
chromosome regions (Cost & Cozzarelli, 2006). 

• In the NHEJ pathway the Nejl protein recruits anti-recombinogenic helicase Srs2 to 
DSB repressing HR and favoring NHEJ or SSA. Interaction between Nejl and Srs2 is 
enhanced by phosphorylation of Nejl (Carter et al., 2009). 

• Phosphorylation of Sae2, a nuclease that regulates DNA end resection in mitotic and 
meiotic cells, determines whether a DSB is channelled into NHEJ or HR (Huertas et al., 
2008). 

• Nuclease Exol that generates ssDNA is regulated by phosphorylation. Rad53- 
dependent phosphorylation reduces Exol activity (Morin et al., 2008). 

• Rad51 has a central role in HR. Human Rad51 is phosphorylated in response to HU. Its 
phosphorylation has an activating role (Sorensen et al., 2005). 

• Rad52 is sumoylated after DNA damage and during meiosis and this sumoylation is 
essential for activating HR and SSA (Sacher et al., 2006). Rad52 is excluded from the 
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nucleolus containing rDNA repeats to diminish the negative effect of HR in repetitive 
sequences. Cells with mutated Rad52, which cannot interact with SUMO, form foci 
within the nucleolus, resulting in slightly elevated rDNA recombination (Torres-Rosell 
et al., 2007). 

• Several components of the replication fork including Mrcl are phosphorylated after 
fork stalling, including Mrcl, which plays roles in both the S phase checkpoint and at 
the replication fork. Phosphorylation of Mrcl is required for activation of the Rad53 
kinase during the S phase checkpoint but is not required for DNA replication (Lou et 
al, 2008). 

• Phosphorylation of histone H3 has a role in DNA replication. Phosphorylation of H3 
Thr45 is mediated by the kinase Cdc7-Dbf4. It has been shown that Thr45 
phosphorylation peaks during DNA replication and that the loss of this 
phosphorylation site causes phenotypes consistent with replicative defects (Baker et al., 
2010 ). 

• Topoisomerase Top2 cleavage and re-sealing of the phosphodiester backbone is 
required for replication and recombination processes. It has been shown that 
sumoylation is essential for Top2 functioning (Bachant et al., 2002). 

• Kinetochore proteins NdclO, Birl, Ndc80, Cep3 were also shown to be sumoylated 
(Montpetit et al., 2006). 

Of particular interest are the post-translational modifications of proteins like PCNA that 
directly connect replication and recombination processes. Sumoylation and different types 
of ubiquitination on PCNA activate a different set of functions to deal with particular types 
of replication stresses: 

• PCNA is essential in DNA replication and DNA repair. It is loaded on chromatids and 
functions as a processivity factor for DNA polymerases and other proteins that regulate 
replication-associated processes. Sumoylation and ubiquitination of PCNA both occur 
in the S phase: sumoylation during normal S-phase and ubiquitination following DNA 
damage. Sumoylation of PCNA prevents unwanted recombination repair during DNA 
replication. PCNA sumoylation favors physical interactions with anti-recombinogenic 
helicase Srs2 and replication forks (Papouli et al., 2005; Pfander et al., 2005). Helicase 
Srs2 inhibits recombination by disrupting Radi nucleoprotein filaments from ssDNA 
(Veaute et al., 2003). DNA damage induces mono- or poli-ubiquitination of PCNA. 
Monoubiquitination of PCNA favors interaction with translesion DNA polymerases 
and promotes error-prone translesion synthesis, while polyubiquitination on the same 
residue promotes error-free damage repair, which relies on recombination mechanisms 
(Hoege et al., 2002). Poliubiquitination with recombination repair is favored over 
monoubiquitination. 

7. Telomere: DNA replication can be replaced by recombination 

Telomeres are nucleoprotein structures at the ends of eukaryotic chromosomes. They play a 
crucial role in maintaining genomic stability by providing chromosome-end protection. 
Telomeric DNA with repetitive guanine-rich DNA sequences is able to form protective t- 
loop and G-quartet structures; and together with interacting specific proteins, form a 
protective 'cap' that distinguish telomeres from DNA double-strand breaks (Simonsson, 
2001 ). 
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Telomeres are duplicated by special DNA replication using a specific DNA polymerase, 
telomerase. When this special telomere replication is unavailable, DNA recombination at 
telomeres can be replaced by recombination mechanisms. 

7.1 Telomere DNA replication 

Because of single-stranded 3' overhangs and bound proteins, telomeres are specifically 
replicated by telomerase. Telomerase is a DNA polymerase that is specifically dedicated to 
the replication of chromosome ends. It is composed of several components: a reverse 
transcriptase catalytic subunit - RNA that provides a template for DNA synthesis, an Est2 
catalytic subunit, and additional associated proteins, Estl and Est3. Cdcl3 ssDNA telomere 
binding protein is a key factor in telomerase recruitment. The telomere binding Ku 
heterodimer (Ku70,80) with its role in the maintenance of telomere DNA structure also plays 
a positive role in the Cdcl3p-dependent recruitment of telomerase. It also stimulates 
telomerase activity after telomerase is bound to the telomere. The telomere binding protein 
Rapl is involved in telomere length regulation and telomeric silencing; Raplp in complex 
with Rifl and Rif2 forms a higher-order structures that is part of a system for telomere 
length measurement (Ji et al., 2008). 

Telomerase activity has been detected throughout the entire cell cycle, but telomere 
elongation is restricted to late S phase when replication machinery is available. It has been 
shown that telomere elongation is coincident with normal DNA replication; telomerase 
action is tightly co-regulated with conventional replication (Diede & Gottschling, 1999). 

As regards replication of sub-telomeric regions, there are no converging forks that would 
complete replication when forks collapse. In sub-telomeric regions, DNA repair mechanisms 
are absolutely required to restart the collapsed fork. 

7.2 Telomere maintenance by recombination 

Without telomerase, telomeres shorten with replication and signal cell senescence. The final 
consequence of telomere erosion is the loss of capping proteins, which leaves telomeres 
exposed to DNA repair mechanisms, potentially inappropriate recombination events or end- 
to-end fusion (Blackburn, 2000). However, rare survivors bypass senescence and death. 
Homologous recombination (HR) could slow senescence and support the growth of the rare 
survivors. HR is suppressed where telomere sequences are of normal length; however, 
telomere recombination is allowed when telomeres erode in the absence of telomerase 
activity or incomplete replication. Most human malignant cells become immortalized by 
telomere extension due to the activation of telomerase, while about 10% of human cancers 
activate a recombination-mediated elongation, the so-called alternative lengthening of 
telomeres (ALT). Recombination mechanisms in budding yeast that enable telomere 
elongation in the absence of telomerase are analogous to ALT (Lydeard et al., 2007). There 
are two types of survivors when telomerase is absent and both depend on the Rad52 
recombination protein (Chen et al., 2001). Type I cells exhibit the amplification of Y' 
sequences and have very short telomere repeats on the chromosome ends. They depend on 
Rad51, Rad54, Rad55, Rad57 proteins, like DSB repair by HR. Type I cells show accelerated 
loss of viability. Type II cells exhibit extremely elongated and heterogeneous telomere 
repeats and only modest Y 1 amplification. They depend on Rad50, Mrell, and Xrs2 proteins, 
like DSB repair by SSA. Type II cells show a slower rate of senescence. It was shown that 
Type II cells with long telomere repeats and dependence on Rad50, resemble human ALT 
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cells (Dunham et al., 2000). Type II cells are as efficient as wild type in maintaining cell 
survival. Telomere recombination is as efficient as telomerase, but cells using recombination 
for telomere maintenance instead of replication by telomerase had a shortened replicative 
life span or accelerated cellular aging (Chen et al, 2009b). In the absence of telomerase and 
both recombination pathways, cells fail to produce survivors. 

It has been shown that short telomere is the preferred substrate of telomerase and is also 
sufficient to recruit recombination proteins; on the other hand, ssDNA at telomeres does not 
appear to be sufficient to attract recombination proteins (Pennaneach et al., 2006). The 
Rad52- Cdcl3 focus is found to colocalize with short telomeres in 50% of cells. It has been 
proposed that a burst of resection at a single telomere marks the transition between 
recombination-resistant and recombination-prone states at telomeres. 

Telomeres bind numerous DNA repair and damage checkpoint proteins as DSBs, but they 
activate neither DNA repair nor DNA damage checkpoint pathways. Paradoxically DNA 
repair and checkpoint proteins do still play critical roles in telomere stability (Lydall, 2003). 

8. Ecmll, yeast protein with a role in replication and recombination 

Ecmll is a protein of the budding yeast S.cerevisiae. The predicted ECM11 gene product has 
302 amino acids and shows no significant similarity to any other protein in the databases. 
From the microarray hybridization data available in the Stanford Genome Database (SGD; 
http://www.yeastgenome.org/), it is apparent that the ECM11 transcript does not oscillate 
during the cell cycle, but is significantly elevated during meiosis. It was confirmed that the 
level of protein Ecmll is in accordance with the mRNA: the level of Ecmll protein is low in 
mitosis, but high in meiosis (Zavec et al., 2008). The highest level of Ecmll is in the early- 
middle phase of sporulation. The central, unique features of meiosis are homologue pairing, 
interhomologue recombination, and synaptonemal complex formation. These regulated 
events are expected to use meiosis-specific regulators and this was the supposed role of 
Ecmll. In our previous work we have demonstrated that Ecmll is indeed required in 
meiosis (Zavec et al., 2004). Mutants ecmllA exhibit complex defects in meiotic replication 
and recombination: diminished meiotic DNA synthesis, reduced crossing over, increased 
gene conversion events and reduced levels of sporulation and spore viability (Fig. 2). 

8.1 Ecmll has important role in meiotic recombination 

ECM11 showed an early to middle pattern of induction in meiosis. There are three groups of 
meiotic genes with the nearly identical expression pattern as ECM11: the meiotic 
recombination genes such as MND1 and REC114, the synaptonemal complex genes such as 
RED1 and MSH4, and the meiotic specific cohesion gene REC8 (SGD; 
http://www.yeastgenome.org/). Such an expression pattern of ECM11 indicates that Ecmll 
could have a role in meiotic recombination process. It has been shown that ecmll 
homozygous diploid strains sporulate more slowly and less efficiently than wild type 
strains and that the viability of spores was reduced to 50% (Zavec et al., 2004). Using 
recombination tests, it was shown that ECM11 is required for crossing over, but not for gene 
conversion. 

8.1.1 The ecmll mutation affects sporulation efficiency 

In the homozygous ecmll mutants, a delay in sporulation was observed and there was 
about a 30% reduction in the maximal level of asci obtained, compared to the isogenic wild 
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Fig. 2. The role of yeast protein Ecmll: mutants ecinllA exhibit complex defects in meiotic 
replication and recombination: diminished meiotic DNA synthesis, reduced crossing over, 
increased gene conversion events and reduced levels of sporulation and spore viability. 

type strain. Heterozygous strains yielded slightly higher sporulation efficiency ecmll 
mutants. Wild type strains carrying additional ECM11 on the centromeric plasmid also 
showed reduced sporulation efficiency comparing to wild types. Obviously, sporulation 
efficiency depends on the copy number of Ecmll protein in the cell during meiosis. Since 
more Ecmll than usual in the cell leads to lower sporulation efficiency, Ecmll is probably a 
part of a heterologous protein complex that requires the correct balance among these 
proteins. 

The kinetics of the sporulation process shows that spores appeared about four hours later in 
ecmll mutants than in wild type. Both strains complete sporulation at the same time, but 
ecmll strains produce lowers level of asci. Since deletion of ECM11 resulted in reduced 
meiotic process progression with reduced maximal level of asci, Ecmll was defined as a 
positive effector of meiosis. 


8.1.2 Ecmll is required for crossing over, but not for gene conversion 

Asci of the parental MAS and homozygous ecmll mutants were dissected and compared. In 
the parental strain, 98% of spores germinated after dissection, while only 50% spores of the 
ecmll mutants survived. Beside strongly reduced spore viability in the ecmll mutants, a 
strongly elevated fraction of tetrads exhibiting two viable spores in ecmll mutants was 
observed (Zavec et al., 2004). The majority of ecmll asci (56%) produced only two viable 
spores, while only 1% of such asci were observed in the parental strain. This result shows 
non-disjunction of homologous chromosomes at the first meiotic division. 
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Crossing over frequency was measured in two-viable-spores asci in the ecmll strain. In 84% 
of asci both surviving spores had the same URA phenotype, indicating that both functional 
spores inherited sister chromatids. Obviously, ECM11 is required for crossing over. 
Additionally, since in meiosis I homologous chromatids are separated, the effect of Ecmll in 
fidelity of chromosome segregation can be confined to meiosis I. Surprisingly, the results of 
the return-to-growth experiment showed slightly increased gene conversion events in ecmll 
mutants. This result raises the possibility that ecmll mutation impairs the crossover process 
at an early step of recombination, at the differentiation of intermediates into crossovers or 
non-crossovers. 

The ECM11 gene is required to produce a normal level of crossover, but not noncrossover 
events, just like genes required for synaptonemal complex formation (ZIP1, ZIP2, ZIPS), 
mutS homologues ( MSH4 , MSH5), some mismatch repair genes ( MLH1, MLH3, EXOl) and 
meoisis specific DNA helicase MER3 (Borner et ah, 2004). So, Ecmll must belong to the 
specific set of proteins that are required for crossing over. Mutants of some of those genes 
(msh5, mer3, zipl, zip2, zip3 ) exhibit normal levels of double-strand breaks and noncrossover 
products but strong coordinate defects in single-end invasions, double Holliday junctions 
and crossover products, thus implying that those mutants are specifically defective in the 
formation and functioning of single-end invasions (Borner et ah, 2004). 

As proper segregation of homologous chromosomes during the reductional division of 
meiosis does not require gene conversion, but does require crossing over to establish 
chiasmata, which physically connect homologues after disassembly of the synaptonemal 
complex, inappropriate segregation of chromosomes in ecmll mutants could be due to 
reduced crossing over in those cells. 

8.2 ECM11 is required for meiotic DNA replication 

To determine if deletion of ECM11 influences meiotic replication, the DNA content of yeast 
cells was analysed by flow cytometry. FACS analysis show that the fraction of ecmll 
mutants entering S phase is reduced compared to the wild type. The ecmll mutants enter S 
phase at the same time as wild type cells, but it is less efficient in the knockout strain. The 
replication process was the most intense between the second and the fourth hour of 
sporulation. In this period, the percentage of 4N cells in the wild type cell population 
increased by 36%, while in the mutant cells this increase was only 18%. The hindrance of 
DNA replication caused by the deletion of the ECM11 gene may be due to many reasons: the 
lower efficiency of origin firing, the lower progression of DNA replication or as a 
consequence of modifications in some other meiotic process. Based on our FACS data we 
cannot exclude any of these possibilities. Since, in all strains tested, deletion of the ECM11 
gene has no impact on generation time during vegetative growth, we assume that the effect 
of the ecmll mutation on S phase is limited to meiosis. 

In the two-hybrid screen it was found that Ecmll strongly interacts with an essential protein 
Cdc6, which has a pivotal role in the initiation of DNA replication (Zavec et ah, 2000). 
Genetic interactions between Cdc6 and Ecmll were also observed. Moderate suppression of 
the cdc6-l mutation by overexpression of ECM11 was detected (Zavec et al, 2000) and 
deletion of ECM11 in cdc6-l genetic background enhances thermosensitivity of the cdc6-l 
mutation (unpublished result). In addition, ECM11 shows synthetic lethality with CDC8, a 
gene coding thymidylate kinase, which is required for DNA synthesis (Tong et al., 2004). 
These data suggest the direct involvment of ECM11 in DNA replication. 
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In meiosis, Cdc6 is required for premeiotic DNA replication as in the vegetative cell cycle, 
but the origin-bound Cdc6 is protected from degradation and occupies origins throughout 
the meiotic cycle (Ofir et al., 2004). Origin-bound Cdc6 could reflect a change in chromatin 
structure that occurs in meiosis and Ecmll bound to Cdc6 could be a part of these changes. 

8.3 Ecmll is modified by SUMO during meiosis 

8.3.1 Sumoylation status of Ecmll during meiosis 

In a wide search of protein-protein interactions it was found that Ecmll interacts with 
SUMO (Smt3) in the two-hybrid system (Ito et al., 2001; Yu et al., 2008). The Ecmll protein 
has two lysine residues, K5 and K101, with a corresponding surrounding sequence IKTE 
that could accept SUMO. It was confirmed by immunoprecipitation that Ecmll is 
sumoylated during meiosis and that Ecmll interacts with SUMO covalently (Zavec et al., 
2008). The majority of Ecmll protein in the cell is sumoylated during meiosis. The HA 
antibodies recognized the HA-tagged Ecmll, as well as additional ~10 and ~20 kDa larger 
species, which correspond to the molecular mass of one or two copies of mature SUMO. 
Sumoylation of Ecmll was confirmed by anti-Smt3 reactivity on anti-HA 
immunoprecipitated samples. Multiple sumoylation was already observed for other 
sumoylated proteins, e.g. Rpbl, the largest subunit of RNA polymerase II (Pol II) (Chen et 
al, 2009a). 

8.3.2 Sumoylation of Ecmll is essential for Ecmll functioning in meiosis 

The importance of Ecmll sumoylation for progression through meiosis was investigated by 
studying the effect of mutations of predicted SUMO consensus sites in Ecmll on 
sporulation efficiency (Zavec et al., 2008). Lysines K5 and K101 in the predicted SUMO 
consensus sites were mutated to the uncharged amino acid asparagine. Mutation of the 
predicted sumoylation site K5 affects the biological function of Ecmll in meiosis. Mutation 
of K5 led to a reduction of sporulation to the same level as that seen in the mutant with the 
deleted ECM11 gene, while mutation of K101 did not affect sporulation level. These results 
suggest, firstly, that the Lys5 at N-terminus of Ecmll is modified by SUMO during meiosis 
and, secondly, that sumoylation is essential for the biological role of Ecmll in meiosis and 
directly regulates Ecmll function in meiosis. 

8.3.3 Ecmll is probably modified by phosphorylation 

As described previous (section 5.), beside sumoylation, phosphorylation is the most 
important modification for replication and recombination proteins. Most components of SCs 
including Zipl, Hopl, Redl, Mre4, and Rec8 are phosphoproteins, but the functional 
significance of this phosphorylation and the kinase responsible for the phosphorylation 
remains to be characterized (Zhu et al., 2010). Ecmll also contains many predicted 
phosphorylation sites. 

9. Conclusion 

In this chapter connections between replication and recombination processes are presented. 
Studying of proteins that affect both processes will further clarify the relationship between 
these two processes. One of such proteins is yeast protein Ecmll, which affects meiotic 
replication and crossing over and is typically modified by sumoylation. Studying such 



The Relationship Between Replication and Recombination 


55 


proteins and discussing the relationship between replication and recombination from 
different points of view may improve understanding of living systems on a higher level. 
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1. introduction 

All living organisms face a constant barrage of DNA damage from anthropogenic and 
naturally occurring external and endogenous sources, yet DNA provides the blueprint for all 
other cellular structures. Unlike these other structural and functional molecules, DNA is not 
turned over in a cycle of breakdown and rebuild; DNA, by the nature of its function as an 
informational macromolecule and its double stranded structure, is faithfully repaired and 
copied to maintain the encoded information. In the case of multicellular organisms, the focus 
of this review, fidelity of information prevents disease, both heritable (e.g. genetic disorders), 
and within an individual (e.g. cancer). However, errors in DNA replication and repair serve as 
the grist of evolution; in some sense, these errors are essential to life as we know it. 

With few exceptions, the repair of DNA damage requires the action of one or more DNA 
polymerases. In many cases, these are specialized polymerases, recruited to the site of damage 
for their specific biochemical properties. In this literature review, I will present different types 
of DNA damage, the biochemical systems utilized in the repair of this damage, the role of 
various enzymes in this repair - emphasizing the role of specific DNA polymerases - and the 
outcome of repair - including the resulting mutation spectrum - where relevant. 

2. DNA polymerases 

DNA polymerases are responsible for the replication of DNA. They perform this function by 
adding free deoxynucleotides to the 3' end of a DNA strand or RNA primer and extending 
the strand in the 5' direction; they are not able to synthesize DNA de novo without this 3' 
hydroxyl (Baker & Bell 1998; Hubscher et al. 2002). They are typically composed of several 
subunits, but an in depth discussion of polymerase components is beyond the scope of this 
chapter; please see Kawasaki and Sugino for a more in depth discussion of polymerase 
subunits (Kawasaki & Sugino 2001). The mechanism of nucleotide insertion by a polymerase 
is a process consisting of 6 steps binding of the DNA template, binding of the incoming 
dNTP, undergoing a change in conformation to become more catalytically effective, 
formation of the phosphodiester bond, release of the pyrophosphate group, and 
translocation to the next template base or dissociation from DNA (Kuchta et al. 1988; Kuchta 
et al. 1987; Patel et al. 1991; Washington et al. 2000; Washington et al. 2001; Wong et al. 1991). 
The structure of a polymerase is similar to that of a right human hand grasping a DNA 
strand, and is thus described as having finger, palm and thumb subdomains (Steitz 1998). 
Synthesis is carried out using the opposite strand as a template—the semi-conservative 
model of DNA synthesis (Meselson & Stahl 1958). 
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2.1 DNA polymerase families 

Polymerases are organized into 7 families by their sequence, structure, and function. These 
polymerase families are: A, B, C, D, X, Y and RT. Each family has different properties and 
roles. For example, A Family polymerases are responsible for the bulk of S-phase DNA 
synthesis, and Y Family polymerases are responsible for Translesion Synthesis (TLS) and the 
bypass of some DNA lesions. A brief discussion of bacterial polymerase families follows, as 
does a more detailed discussion of multicellular eukaryotic polymerase families, 
representative members, and roles (summarized in Table 1). 

A Family polymerases are replicative and repair polymerases that include the E. coli 
replicative polymerase poll, T. aquaticus poll, and the human mitochondrial polymerase, 
polY. This family also includes the E. coli T7 polymerase, one of the most studied 
polymerases. Orthologues of this family include members of the B Family of eukaryotic 
polymerases (Hubscher et al. 2002). The tight active site and 3' primer interactions prevent 
base pair mismatches, and thus these polymerases have a low error rate. pol0 is a low 
fidelity A Family member with roles in repair, possibly including Base Excision Repair 
(BER) and Interstrand Crosslink Repair (ICLR) (Chan et al. 2010; Prasad et al. 2009). 

The B Family of polymerases is closely related to the A Family in structure and function. 
They are DNA directed DNA polymerases and this family includes the eukaryotic 
replicative and repair polymerases pola, pol5, and pole, pola with its accompanying 
primase is responsible for initiation of DNA synthesis (Harrington & Perrino 1995). pol5 and 
pole cooperate to accomplish leading and lagging strand DNA synthesis (Chilkova et al. 
2007). polS and pole are also involved in Homologous Recombination Repair (HRR) of DNA 
(Asturias et al. 2006; Kelman 1997; Maloisel et al. 2008). Another B Family member, polCj, is 
involved in TLS (Haracska et al. 2003). 

C Family polymerases represent the main bacterial chromosomal synthetic polymerases 
(Lamers et al. 2006). They are fast moving, have proofreading capability, and are structurally 
and possibly evolutionarily distinct from the other polymerase families (Bloom et al. 1997; 
Lamers et al. 2006). Another distinct family are the D Family polymerases. They are archaeal 
polymerases that are assumed to function as replicative polymerases (Ishino et al. 1998; 
Yamasaki et al. 2010). 

X Family polymerases are extensively involved in a variety of DNA repair mechanisms. This 
family includes poip, polo, polX, and polp, as well as terminal deoxynucleotidyl transferase 
(TdT). Some of the X Family members, including poip and polX, can cleave a 5' abasic 
deoxyribose sugar, a critical function in BER and possibly required for Non-Homologous 
End Joining (NHEJ) (Fan & Wu 2004; Garcia-Diaz et al. 2001). TdT expression is limited to 
developing leukocyte lineages where where it plays a critical role in V(D)J Recombination, a 
specific type of NHEJ (Mahajan et al. 1999). polX and poip are required for NHEJ (Fan & Wu 
2004; Mahajan et al. 2002). polo works in concert with pole in sister chromatid cohesion and 
HRR (Edwards et al. 2003). 

The Y Family of polymerases, including polr], poll, polx, and REV-1 are involved in TLS. 
Each polymerase in this family has a different bypass preference. For example, polq will 
bypass cyclothymine dimers (CTD), inserting two adenines opposite the lesion in an error 
free manner. Loss of polq because of its involvement in CTD bypass gives a Xeroderma 
pigmentosum variant phenotype as with many Nucleotide Excision Repair (NER) enzymes 
(Johnson et al. 1999b; Washington et al. 2000). poll is not able to replicate past a CTD, 
however, it is involved in the nucleotide incorporation opposite an abasic site and the 3' 
thymine of a (6-4) photoproduct (Johnson et al. 2000). Although it can insert bases opposite 
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the lesion, this polymerase cannot extend the nascent DNA strand. While polx cannot insert 
bases opposite damage as polr|and poll can, it does extend opposite the 3' end of the lesion 
(Washington et al. 2002). REV-1 is thought to play a supporting role as well, triggering 
synthesis of the other Y family members at these lesions (Guo et al. 2003; Ohashi et al. 2004; 
Tissier et al. 2004). The roles of polymerases in TLS are discussed in more detail below. 


Polymerase 

Family 

Polymerase 

Role in DNA 
Replication 

DNA Repair Pathway Participation 

A 

pol0 

N/A 

BER?, ICLR? 

B 

pola 

Initiation of 
Replication 

N/A 

pol5 

Processive 

Synthesis 

HRR 

pole 

Processive 

Synthesis 

Long Range HRR DNA Synthesis, 

SSA 

poie; 

N/A 

TLS, HRR? 

X 

poip 

N/A 

BER 

polA 

N/A 

NHEJ 

polp 

N/A 

NHEJ 

polo 

N/A 

Stimulates pole 

TdT 

N/A 

V(D)J Recombination 

Y 

polq 

N/A 

TLS past CTD 

poll 

N/A 

TLS past abasic and 6-4 photoproduct 

polK 

N/A 

Extension from 3' of bypass by polq and 
poll 

REV-1 

N/A 

Triggering Synthesis by the Other Y 
Family Members 


Table 1. DNA Polymerases Involved in DNA Repair and Their Roles 


The RT Family of polymerases includes RNA-directed DNA polymerases that use RNA 
primers or are involved in viral reverse transcription, like the eukaryotic polymerases 
responsible for telomere maintenance, telomerase (Gotte et al. 1999; Greider & Blackburn 1989). 
RT stands for Reverse Transcriptase, the primary function of members of this polymerase 
family. 

3. DNA repair 

The goal of all DNA repair is to maintain the integrity of the genome with minimal, and 
ideally no changes to the original DNA sequence. In the case of single strand damage, such 
as spontaneous depurination, oxidation, alkylation, and ultraviolet (UV) light 
photoproducts, this is readily achieved by Direct Reversal Repair (DRR), in which an 
enzyme directly returns the lesion to its former, undamaged state, or utilizing the 
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antiparallel DNA strand as template, as in TLS, or following excision of the damage as in 
BER and NER. However, when both strands are damaged, repair mechanisms including 
Double Strand Break Repair (DSBR) and Interstrand Crosslink Repair (ICLR) are utilized, 
and repair becomes increasingly complicated and in many cases mutagenic. 

The type of DNA damage incurred dictates the mechanism(s) of DNA repair. Initial 
recognition of the lesion directly recruits, or signals for the recruitment of repair factors. 
The exact repair mechanism implemented for a specific type of lesion may vary, depending 
upon available factors or cell cycle status. 

3.1 Direct reversal repair 

DRR represents a set of enzymes that catalyze direct repair of the damaged base/s, 
returning the DNA to its previous, undamaged configuration and sequence. Examples of 
DNA damage and their repair proteins would include 0 6 -alkylguanine being repaired by 
0 6 -alkylguanine DNA alkyltransferase (AGT), 1-alkyladenine being repaired by AlkB 
dioxygenase human homologues AHB2 and AHB3, cyclopyrimidine dimers (CPD) being 
repaired by DNA photolyase and 6-4 photoproducts being repaired by 6-4 photolyase 
(Duncan et al. 2002; Kim et al. 1993; Wibley et al. 2000; Zhao, X. et al. 1997). As these specific 
enzyme mechanisms execute a direct catalytic repair of DNA damage, they do not require 
DNA synthesis in repair. However, some of these same forms of DNA damage, for example 
CPD, can be repaired through alternate, DNA polymerase requiring mechanisms, such as 
NER. 

3.2 Base excision repair 

BER is utilized in the repair of DNA damage incurred on a single strand, where there is little 
structural alteration of the DNA backbone. This damage includes apurinic sites resulting 
from spontaneous depurination, oxidized or alkylated bases, or base mismatches resulting 
from 5-methylcytosine deamination (T/G mismatch) or polymerase errors. Because there is 
little backbone distortion, these lesions tend not to block replicative polymerases, and as a 
result, this damage can be highly mutagenic if not detected and repaired. For example, an 
unrepaired 8-oxoguanine at DNA synthesis will lead to G:C to T:A transversions common to 
many solid tumors (Bruner et al. 2000). 

3.2.1 Steps of BER 

In BER, specific glycosylases recognize and bind specific lesions (Banerjee et al. 2006; Bruner 
et al. 2000; Engelward et al. 1997; Klungland et al. 1999; Parsons 2003) with the assistance of 
accessory proteins such as MutM (Banerjee et al. 2006) (Figure 1). Examples include 8- 
oxoguanine DNA glycosylase recognizing and binding 8-oxoguanine (Bruner et al. 2000) or 
alkyladenine DNA glycosylase recognizing and binding alkyladenine (Engelward et al. 
1997). The damaged base is flipped out and cleaved by the glycosylase (Banerjee et al. 2006) 
generating an abasic site. Apurinic-apyrimidinic endonuclease (APE1) will then nick or 
break the DNA backbone (Mol et al. 2000; Srivastava et al. 1998) (Figure 1). 

At this point, two possible pathways. Short Patch Repair (SPR) and Long Patch Repair 
(LPR), diverge (Kubota et al. 1996). In SPR, pol(3 will cleave the 5' abasic sugar and replace 
the missing nucleotide (Garcia-Diaz et al. 2001). X-ray Repair Cross Complementing Protein 
1 (XRCC1), along with its binding partner, DNA Ligase III will seal the backbone nick. In 
LPR, Replication Factor C (RFC) loads Proliferating Cell Nuclear Antigen (PCNA) at the 
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incision (Kelman 1997). PCNA will facilitate pol|3 binding (a common step in the 
recruitment of many polymerases to damaged sites) and polp will synthesize a stretch of 
DNA, creating a 5' flap (Srivastava et al. 1998). This flap is trimmed by Flap Endonuclease 1 
(FEN1), and DNA Ligase I seals the remaining nick or break in the DNA backbone (Prasad 
et al. 2000; Srivastava et al. 1998) (Figure 1). In both cases, the DNA is repaired in an error- 
free manner, if the damage is recognized before S-phase and DNA synthesis. 

BER 


I Damage with Little 
▼ Backbone Disruption 


Recognition by Damage 
, r Specific Clycosylase 


I AP Site Generated, 
Y AP Endonuclease 


Short Patch Repair 
Polp Inserts Single nt 


/ 


Long Patch Repair PCNA/ 
DNA Polp Synthesis, Parental 
Strand Displace ment 


DNA Ligase III/ 
XRCC1 


^ FEN1 Trims Flap 


I DNA Ligase I/ 
I XRCC1 


Fig. 1. The generalized steps of BER 


3.3 Nucleotide excision repair 

Whereas the damage to which BER responds does not significantly distort the DNA 
backbone, NER explicitly recognizes this backbone distortion utilizing two different 
systems—Transcription Coupled Repair (TCR, (Bohr et al. 1985; Mellon et al. 1987)) and 
Global Genome Repair (GGR, (Aboussekhra et al. 1995; Araujo et al. 2000)). The types of 
damage recognized by these systems primarily consist of bulky adducts, including UV 
photoproducts (cyclopyrimidine dimmers and 6-4 photoproducts (Mellon et al. 1987; Ng et 
al. 2003)) or N-acetoxy-2acetyl-aminofluorene induced adducts (Ng et al. 2003). These bulky 
adducts will stall DNA polymerases as well as transcription machinery, and are therefore 
less mutagenic than the types of damage repaired by BER, but they can be cytotoxic 
(Mitchell et al. 2003). 


3.3.1 Steps of NER 

TCR is activated by the stalling of RNA polymerase II upon encountering a bulky adduct. 
This recruits Cockayne Syndrome Proteins A and B (CSA and CSB, (Henning et al. 1995)), 
Xeroderma Pigmentosum Protein A (XPA), Binding Protein 2 (XAB2, (Nakatsu et al. 2000)), 
and High Mobility Group Nucleosome Binding Protein 1 (HMGN1, (Birger et al. 2003)) to 
the site of damage. This system for sensing DNA damage can only function on actively 
transcribed genes. With GGR, the bulky adducts are recognized by the Xeroderma 
Pigmentosum Protein C (XPC)-Homologue of RAD23 B (HR23B) complex (Figure 2), with 
help from Xeroderma Pigmentosum Protein E (XPE, also known as Damaged DNA Binding 
Protein 2 (DDB2)) in the case of photodimers (Kulaksiz et al. 2005; Yokoi et al. 2000). 
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i 

] 
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PCNA/DNA Polft/c Synthesis 
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Fig. 2. Generalized Steps of NER 

Following damage recognition, these two repair systems utilize the same enzymes to 
catalyze repair. These include the basal transcription factor (TFIIH), with the helicases 
Xeroderma Pigmentosum Protein B (XPB, 3' to 5' helicase) and Xeroderma Pigmentosum 
Protein D (XPD, 5' to 3' helicase), as well as Xeroderma Pigmentosum Protein A (XPA) and 
Xeroderma Pigmentosum Group G (XPG)(Tantin 1998; Tantin et al. 1997). Once bound, the 
TFIIH complex opens the lesion. XPA recruits Replication Protein A (RPA), which stabilizes 
the single stranded DNA (ssDNA) (Li et al. 1995a; Li et al. 1995b). The Excision Repair Cross 
Complementation Group 1-Xeroderma Pigmentosum Group F (ERCC1-XPF) heterodimer 
then incises the DNA backbone 5' of the damage, while XPG incises the backbone on the 3' 
side of the adduct (O'Donovan et al. 1994; Sijbers et al. 1996). 

With 25-30nt surrounding the lesion removed, RFC loads PCNA, as in BER (Kelman 1997). 
At this stage, variations in the final steps may be observed with dividing and non-dividing 
cells (Fousteri & Mullenders 2008). In the case of dividing cells, DNA pol5 and pole will 
syntheize DNA across the gap, and DNA Ligase I will seal the resulting nick (Aboussekhra 
et al. 1995; Araujo et al. 2000; Araujo & Wood 1999). In the DNA of non-dividing cells (out of 
S-phase), polx might instead be utilized to bridge the gap, with XRCC1-DNA Ligase III 
sealing the nick (Moser et al. 2007; Ogi & Lehmann 2006). Again, as with BER, the utilization 
of the undamaged strand results in error-free repair. 


3.4 Translesion synthesis 

The DNA lesions produced in cells by a variety of chemical or physical agents 
can sometimes escape the repair mechanisms of the cells, including NER. Because 
these lesions distort the structure of the DNA, the high-fidelity polymerases are no longer 
able to bind and replicate past the lesions. Other polymerases, the low-fidelity, TLS 
polymerases, are recruited at the replication fork and are involved in replication past the 
DNA damage (Burgers et al. 2001; Ohmori et al. 2001). As noted above, the most significant 
classes of TLS enzymes encountered in eukaryotes are the members of the Y-family of DNA 
polymerases (polr), i, k and Revl) also known as UmuC/DinB/Revl/Rad30 superfamily, in 
addition to a member of the B-family of polymerases, pol£j (Ohmori et al. 2001; Zhao, B. et al. 
2004). 
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3.4.1 Steps of TLS 

The first step in DNA repair is the recruitment of TLS polymerases at the stalled replication 
fork by a monoubiquitnated, sumoulated, or otherwise modified PCNA (Haracska et al. 
2001a; Haracska et al. 2001b; Waters et al. 2009). Rev-1 is suspected to act as a trigger in 
mobilizing the other polymerases, polr|, poll, and pok, to insert a base or bases opposite the 
damaged nucleotide/s (Guo et al. 2003; Ohashi et al. 2004; Tissier et al. 2004). After bringing 
the polymerases to the lesion site during the repair process, the same Rev-1 will bind pol^. 
polij is a member of the B-family of polymerases that is not very efficient in inserting the 
nucleotides opposite lesions, but can extend the primer termini (Haracska et al. 2003). Once 
this polymerase binds, it will begin to synthesize a stretch of DNA opposite the damage site, 
completing the bypass of the lesion. Two generalized models for TLS include polymerase 
switching, in which a replicative polymerase stalls, PCNA is modified, and a TLS 
polymerase extends from the lesion, and once bypassed, replication resumes. The other 
model is gap filling, in which a gap is left following replication fork stalling at a damaged 
site, and TLS fills this gap. These models are reviewed in Waters et al. (Waters et al. 2009). 

3.5 Double strand break repair 

Unlike the damage repaired by DRR, BER, and NER, DSB involve both strands of DNA. 
This eliminates the ability of DSBR to utilize DNA's built in backup, the opposing DNA 
strand, as it too is damaged. This damage is typically caused by ionizing radiation, 
neighboring single strand breaks, natural processes such as V(D)J recombination, meiotic 
and mitotic crossing over, yeast mating type switching, and the collapse or stalling of 
replication forks (Khanna & Jackson 2001; Sugawara et al. 2000). 

There are three main pathways for the repair of DSB:— Single Strand Annealing (SSA), 
NHEJ, and HRR. Local differences in DNA sequence, the availability of repair factors, the 
availability of a homologous sequence, and cell cycle status affect which mechanism of 
DSBR is utilized. NHEJ, of which there are two alternate biochemical pathways, the 
primary, Ligase IV dependent NHEJ (D-NHEJ) and the backup, Ligase III dependent NHEJ 
(B-NHEJ), is utilized for most DSB in mammalian cells (Mladenov & Iliakis 2011). SSA can 
occur where there are direct repeats in DNA sequence, and there are lower levels of NHEJ 
components (Fishman-Lobell et al. 1992). HRR functions predominantly in S and G2 phases 
of the cell cycle, when homologous substrates are readily available (Aylon et al. 2004). 

3.5.1 Steps of NHEJ 

DSBR by NHEJ can follow a main D-NHEJ (named for the requirement of DNA-PKcs) 
pathway responsible for the bulk of DSBR in mammalian cells, or a Backup B-NHEJ. Both 
pathways repair DSB in similar manners, but utilize different proteins at each step 
(Mladenov & Iliakis 2011). For simplicity. Figure 3 will only list D-NHEJ, but B-NHEJ 
follows similar catalytic steps, as discussed below. 

In mammalian D-NHEJ, the DSB is recognized and bound by Ku70/Ku80 heterodimer. This 
leads to binding of the DNA Dependent Protein Kinase catalytic subunit (DNA-PKcs) 
yielding a functional DNA-PK holoenzyme (Cary et al. 1997). The ends are modified by 
polynucleotide kinase (PNK) and terminal deoxynucleotide transferase (TDT), also, an as 
yet unidentified endonuclease (possibly Artemis, a substrate of DNA-PK) will then resect 
the 5' ends leaving 3' ssDNA. These ends will be filled by DNA polk and p, and joined by 
the DNA Ligase IV-XRCC4 complex, enhanced by XRCC4 Like Factor (XLF) (Bryans et al. 
1999; Fan & Wu 2004; Mahajan et al. 2002; Yano et al. 2008) (Figure 3). 
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Less is known about B-NHEJ then D-NHEJ, but B-NHEJ seems to act more slowly than D- 
NHEJ, though it can ultimately achieve the same ends, the repair of DSB (Iliakis 2009; Wang, 

H. et al. 2003; Wang, M. et al. 2006; Wu et al. 2008). This alternate pathway was identified in 
cell lines deficient in many of the D-NHEJ proteins (Nevaldine et al. 1997; Wang, H. et al. 
2001a; Wang, H. et al. 2001b). In B-NHEJ, Poly [ADP-ribose] polymerase 1 (PARP-1), usually 
associated with single strand break repair, is responsible for end recognition and binding 
(McKinnon & Caldecott 2007; Wang, M. et al. 2006). A role for the MRN complex in processing 
of the break has been suggested based upon reduced end joining in D-NHEJ deficient cells 
when Mrell is inhibited (Rass et al. 2009). DNA Ligase III and XRCC1 are regulated by PARP- 

I, and a role for Ligase III has been demonstrated in NHEJ (McKinnon & Caldecott 2007; 
Windhofer et al. 2007). Enhancement of Ligase III activity by Histone HI, suggests a role in B- 
NHEJ as well (Rosidi et al. 2008). No direct evidence for the role of a specific polymerase has 
been identified in B-NHEJ, however, it in known that PARP-1 interacts with DNA pola, 
suggesting possible involvement in this pathway (Dantzer et al. 1998). 

nhfi 


Double Strand Break 


Binding, Stabalization, S' Resection 
KU70/80, DNA-PK, MRN 


Pol/, and PoluGap Filling, Ligation 
XRCC4/XLF, DNA Ligase IV 


Fig. 3. Generalized steps of NHEJ 

If the two ends that are joined by NHEJ are from a single double strand break, the outcome 
can be conservative or lead to deletions of varying sizes, depending upon the extent of 
processing of the ends. If, however, the two ends that are rejoined were from separate DSBs, 
the result will be a translocation. While NHEJ is quite proficient at rejoining DSB ends and 
eliminating this highly cytotoxic DNA damage, it does not involve a mechanism to choose 
which ends to rejoin, and is, thus, considered to be a non-conservative mechanism for DNA 
repair. 

3.5.2 Steps of SSA 

SSA requires a more specific set of conditions than NHEJ, specifically neighboring repeats 
either side of the DSB, and will predominantly occur in S-phase (Frankenberg-Schwager et 
al. 2009; Sugawara et al. 2000). At the site of the DSB, there is a 5' to 3' resection, likely by the 
MRN complex, that leaves 3' tails. RAD52 binds the 3' ends, and these tails are stabilized by 
the ssDNA binding protein, RPA (Stasiak et al. 2000; Van Dyck et al. 1999; Wold 1997). These 
factors are sufficient for annealing of the repeat sequences. FEN1 or ERCC1/XPF then trim 
the 3' overhangs (Al-Minawi et al. 2008; Gottlich et al. 1998). The remaining gaps are filled 
and ligated by DNA pole and DNA Ligase III (Gottlich et al. 1998) (Figure 4). 

As neighboring repeats are annealed, sometimes at distances of 40bp to l-2kb apart, SSA 
results in deletions of varying sizes (Gottlich et al. 1998; Richardson & Jasin 2000). With 
multiple genomic DSB, SSA has also been demonstrated to yield translocations (Richardson 
& Jasin 2000). Presumably, these result from homologous sequences on non-homologous 
chromosomes annealing and being joined by the SSA machinery. 
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Fig. 4. Generalized Steps of SSA 


3.5.3 Steps of HRR 

Like SSA, HRR requires a homologous sequence, typically a sister chromatid; thus most 
HRR occurs in late S or G2 phases of the cell cycle. Following DSB formation, there is a 5' 
resection leaving 3' tails. Rad51 recombinase (made up of Rad51B and Rad51C) binds these 
single stranded regions, a homologous region is identified, and a D-loop is formed (Sung & 
Robberson 1995). This complicated process utilizes a number of other proteins, including the 
MRN complex which, along with BRCA1 and CtlP, again serves a role in resection; BRCA2, 
which facilitates Rad51 loading and facilitates recombination; RPA, which acts to stabilize 
the ssDNA and promote strand exchange with Rad51; Rad54, which aids in chromatin 
remodeling; and Hop2-Mndl, which help promote D-loop capture and processing (Chen et 
al. 2008; Pellegrini et al. 2002; Solinger et al. 2001; Stauffer & Chazin 2004; Sung & Robberson 
1995; Vignard et al. 2007). (Other protein systems are responsible for meiotic recombination 
and are not discussed in detail here, but are reviewed in (Smith & Nicolas 1998)). 


HRR 


1 

i 



Double Strand Break 


S' Strand Resection 
by MRN Complex 


Homolgous Strand Invasion/Heteroduplex Formation 
RAD51, RAD54. BRCA1/2. CtlP. Hop2/Mndl 



i 


DNA Poll) Extension 



Heteroduplex Resolution 
(mus81?) 



1 


DNA Ugase I 


Fig. 5. Generalized Steps of HRR 











































72 


DNA Replication and Related Cellular Processes 


DNA polr| will then extend from the 3' end of the invading strand, elongating the D-loop 
(Mcllwraith et al. 2005). The invading strand is then displaced and anneals with the second 
DSB end, being ligated by DNA Ligase I (Goetz et al. 2005) (Figure 5). This mechanism of 
repair avoids the formation of a Holiday junction and the risk of crossover (referred to as 
Synthesis Dependent Strand Annealing or SDSA) (Ferguson & Holloman 1996). However, 
where the goal is crossover, as in meiotic recombination, the extended D-loop will bind the 
second DSB end creating a Holiday junction, leading to crossover and non-crossover 
products (referred to as the DSBR pathway) (Szostak et al. 1983). 

A specialized version of HRR, called Break Induced Replication (BIR), acts to preserve single 
DSB ends such as those at chromosomal termini, or, as we will discuss in ICLR, at stalled 
replication forks. In BIR similar factors are utilized to process the ends, invade a 
homologous or repeat sequence, extend, and resolve the lesion. Of note, the pola primase is 
required for initiation of replication by pol5 (Maloisel et al. 2008). pole is required for long- 
range synthesis, and, unique to BIR, Pol32 is required but does not function in other HRR 
pathways (Asturias et al. 2006; Lydeard et al. 2007). 

HRR is the only truly conservative repair mechanism for double strand breaks, with both 
SDSA and DSBR, when a holologous chromosome provides the template for repair. Unlike 
NHEJ and SSA, it does not typically lead to translocations or deletions, but DSBR can lead to 
crossing over of chromatids. In BIR, if the substrate for recombination is a homologous 
chromatid, repair is conservative; however, if a repeat from a non-homologous sequence is 
utilized, a translocation may result. 

3.6 Interstrand crosslink repair 

There are certain chemicals, endogenous and exogenous, that covalently link both strands of 
a DNA molecule together; these agents include malondialdehyde, mitomycin C, or psorlen 
(Scharer 2005). With an ICL, much like a DSB, the anti-parallel strands can no longer be used 
as template for one another. ICLs covalently link both strands of DNA together preventing 
strand separation required for transcription and replication, making ICL inducing agents 
particularly potent killers of cycling cells (Dronkert & Kanaar 2001). For this reason, many 
ICL inducing agents are used as chemotherapeutic agents in the treatment of cancer (Lawley 
& Phillips 1996; Lord et al. 2002; Ryu et al. 2004). Stalled replication forks will also utilize 
ICLR under certain circumstances (McCabe et al. 2008). 

The genetic disorder Fanconi Anemia (FA) is intimately associated with ICLR, much the 
way Xeroderma Pigmentosum is associated with NER. There are currently 15 FA groups, 
representing defects in 15 proteins involved in ICL repair. These genes are, FANCA, B 
(Fanconi Anemia Associated Protein of 95kDa, FAAP95), C, D1 (BRCA2), D2, E, F, G, I, J 
(BRCA1 Interacting Protein C-terminal Helicase 1, BRIP1), L, M (Fanconi Anemia Associated 
Protein of 250KD, FAAP250), N (Partner and Localizer of BRCA2, PALB2), O (Rad51C), and 
P (SLX4) (Kitao & Takata 2011). The numerous proteins involved in ICLR reflect the 
complexity of repairing this type of damage. The model presented here (Figure 6) is 
speculative; it is based upon published biochemical functions of the proteins involved and 
the formation of recombination intermediates in FA cells between non-homologous 
chromosomes (Newell et al. 2004). The ICLR pathway likely represents a last ditch 
mechanism of repair for this extremely cytotoxic damage where there are no homologous 
substrates for HRR of these lesions (McCabe et al. 2009). TLS is one other option for damage 
bypass, but does not constitute repair. 
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3.6.1 TLS in response to ICL 

If there are incisions on both sides of an ICL on one strand, TLS has been proposed to 
replicate past the lesion, reducing its cytotoxicity and permitting continuation of the cell 
cycle. TLS utilizing error prone pol^ or polr] might replicate past the ICL following DSB 
formation (reviewed in (Dronkert & Kanaar 2001)). However, polr] mutants show normal 
sensitivity to ICL, suggesting no role for this polymerase in repair (Grossmann et al. 2001). 
Little is known about this pathway compared to NER and HR repair; however, it is thought 
that this pathway helps the cells to bypass an ICL to reduce cytotoxicity as opposed to 
actually repairing the lesion (Dronkert & Kanaar 2001). Further, the severity of the FA 
phenotype with respect to ICL damage suggests TLS is, at most, a minor mechanism of ICL 
repair. 

3.6.2 Steps of ICLR 

It has been suggested that the distortion created by the ICL or the ensuing chromatin change 
could be one of the initial signals for repair (Dronkert & Kanaar 2001). DSB signaling, 
including ATM kinase activity, also plays a clear role in ICLR, as DSB are an important 
intermediate in repair, though data suggests that DSB do not activate the FA pathway 
(Rogakou et al. 1999; Sobeck et al. 2007). The collapsed replication fork at the site of ICL 
damage, or in response to hydroxyurea treatment, collapses and regresses with the help of 
RECA (Robu et al. 2001). This leads to loading of the FA core complex via the DNA 
translocase activity of FANCM/MHF complex (Singh et al. 2010; Yan et al. 2010). The FA 
core complex is comprised of FANCA, B, C, E, F, G, L and M, and is required for 
monoubiquitination of FANCD2 by the E3 ligase FANCL, in concert with the E2 subunit 
UBE2T (Machida et al. 2006) (Figure 6). 

Central to the FA pathway are FANCD2 and its paralog FANCI (Smogorzewska et al. 2007). 
FANCD2 monoubiqitination is traditionally looked upon as the marker of activation of the 
FA pathway. Monoubiquitination is required for FANCD2 and FANCI localization to 
chromatin (Garcia-Higuera et al. 2001; Smogorzewska et al. 2007). Monoubiquitinated 
FANCD2/FANCI colocalizes with BRCA1 in response to DNA damage and at 
synaptonemal complexes (Garcia-Higuera et al. 2001). Additionally, FANCD2 has been 
shown to interact in a constitutive manner with FANCD1/BRCA2 and co-localizes with 
RAD51 in nuclear foci (Hussain et al. 2004). FANCD2 also interacts with the MRN complex, 
which may serve a role in processing a recombination intermediate (Nakanishi et al. 2002). 
FANCA, a core complex component, has been shown to interact with several other proteins. 
Independently of the other FA proteins, FANCA interacts directly with BRCA1 without the 
requirement for DNA damage, suggesting a constitutive interaction (Folias et al. 2002). 
Additionally, FANCA has been suggested to aid in the recruitment of the SWI/SNF 
complex subunit, brahma-related gene 1 (BRG1), and may be involved in chromatin 
remodeling at the site of action of the FA pathway (Otsuki et al. 2001). 

Biochemical studies have identified several proteins forming large complexes with the FA 
proteins. Included in the BRAFT complex are five FA proteins (FANCA, C, E, F, and G), the 
Bloom syndrome helicase (BLM), replication protein A (RPA) and topoisomerase Ilia 
(Topo3a). This complex has a DNA duplex unwinding capability that requires BLM, but not 
FANCA. However, BLM is not required for FANCD2 monoubiqitination, suggesting BLM 
functions in this pathway downstream of core complex signaling for activation of the FA 
pathway (Meetei et al. 2003). FANCJ is a BRCA1 interacting protein that functions as an 
ATP- dependent 5'-3' helicase (Cantor et al. 2001; Cantor et al. 2004). Combined with the 
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interaction of FA proteins with BLM, a 3'-5' helicase, these data suggest the ability of FA 
complexes to open stretches of DNA in both directions (Ellis et al. 1995). 

The discovery of FANCD1 as BRCA2 directly linked the FA pathway and HRR pathway 
(Hirsch et al. 2004). BRCA2 is known to regulate RAD51 controlling the formation of the 
RAD51/ssDNA nucleoprotein filament required for strand pairing during HRR in DSBR 
(Davies et al. 2001; Sharan et al. 1997). In addition, BRCA2 binds FANCD2 and G placing the 
core complex and FANCD2 at sites of homologous recombination repair (Hussain et al. 
2004). Another recombination and FA was uncovered with the identification of FANCN as 
the partner and localizer of BRCA2 (PALB2) (Reid et al. 2007; Tischkowitz et al. 2007; Xia et 
al. 2006). As its name suggests, PALB2 interacts with BRCA2 and is responsible for its 
localization to chromatin; thus, PALB2 is required for BRCA2's function in homologous 
recombination repair and cell cycle checkpoints (Xia et al. 2007; Xia et al. 2006). FANCM, in 
addition to its early binding role, also serves a catalytic function in the processing or 
resolution of the recombination intermediate, as cells from a FANCM patient form radials in 
response to ICL inducing agents, but the FANCM defect does not impact FANCD2 
monoubiquitination (Singh et al. 2009). 



Fig. 6. Hypothetical Steps of ICLR 
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All of this suggests the next step in repair involves a HRR or HRR like repair mechanism, 
likely similar to BIR with a non-homologous substrate, as failure to complete repair yields a 
radial formation (Figure 7), and these radials are between non-homologous chromosomes 
(Newell et al. 2004). It would seem the variety of functions pulled together for ICLR serve to 
stabilize the stalled replication fork, initiate recombination in the absence of an available 
direct homologue by identifying repeats or microhomology suitable for recombination on 
non-homologous chromosomes, synthesis along this sequence past the region affected by 
the ICL (possibly involving pol0/Ligase IV, (Chan et al. 2010)), and subsequent resolution of 
the recombination intermediate (Figure 6). 



Fig. 7. A. Reorganization of non-homologous chromosome recombination intermediates 
from the model in Figure 6 into a chromosomal context (large circles represent centromeres) 
demonstrates the possibility for intermediates from this mechanism to yield radials. B. 
Portion of a metaphase spread of ICL treated FA mutant cells and a radial formation. 


4. Discussion 

With the exception of DRR and possibly a small subset of NHEJ, DNA damage repair 
requires DNA replication. BrdU incorporation, a general marker of DNA synthesis, has been 
successfully used as a surrogate marker for DNA damage repair (Kalle et al. 1993; Kao et al. 
2001). The requirement of replication for repair relates to the double stranded structure of 
DNA, and the idea of having a built in backup copy of information on the opposing strand. 
To a certain extent, this holds true even in the case of double stranded damage, though the 
backup may be a repeat sequence on the same stretch of DNA or on a non-homologous 
chromosome, in the case of SSA and ICLR respectively. 

Different types of DNA damage will utilize any of a variety of DNA polymerases, based 
upon the structure/function of this polymerase. The evolution of various DNA polymerase 
families with specific roles supports the importance of replication as an indispensible tool in 
the repair of DNA damage. TLS utilizes polymerases with open active sites to permit 
synthesis past an adduct (Trincao et al. 2001). Long-range synthesis in HRR requires pole, 
because of its processivity (Asturias et al. 2006). BIR requires the normally dispensable pol5 
subunit, pol32, to facilitate replication restart in response to this specific recombination 
based repair mechanism (Lydeard et al. 2007). The importance of polq in NER, is 
demonstrated by the similarity of disease spectrum present in XP variant and the XP group 
genes constituting catalytic functions in the repair of bulky adducts (Johnson et al. 1999a). 
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The evolutionary persistence of many of these polymerases demonstrates that this intimate 
involvement of DNA replication in repair is an indispensible facet of life, as we know it. 
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1. Introduction 

The DNA mismatch repair (MMR) pathway is one of the most important genome 
surveillance systems involved in governing faithful transmission of genetic information 
during DNA replication and homologous recombination (Jiricny 2006). MMR deficiency 
attributes to a phenotype known as microsatellite instability (MSI), a condition that 
predisposes individuals to a heightened risk of cancer development (Harfe & Jinks- 
Robertson 2000; Iyer et al. 2006; Jiricny 2006; Kunkel & Erie 2005). Notably, hereditary 
nonpolyposis colorectal cancer (HNPCC) or Lynch syndrome is the most common 
malignancy identified in individuals with MMR gene mutations. The MMR pathway relies 
on the coordinated functions of a family of proteins that recognize mismatched nucleotides 
and initiate subsequent repair actions ranging from excision of a fragment of DNA 
containing the mismatched nucleotide to DNA repair synthesis. The MMR system is well 
conserved evolutionarily from bacteria to humans, of which the eukaryotic MMR pathway 
is evolved to possess multiple homologous genes that carry out conserved and diverse 
functions corresponding to their bacterial counterparts (Modrich 1991). Genes encoding 
homologues of the bacterial MMR proteins such as MutS and MutL have been identified in a 
variety of eukaryotes including yeast, plants, nematodes, and mammals. However, the 
bacterial MutH appears to be an exception—this methylation-sensitive endonuclease, 
exclusively functioning in gram-negative bacteria, directs the action of MMR to 
hemimethylated newly synthesized bacterial DNA (Iyer et al. 2006). 

All eukaryotic organisms possess multiple MutS homologues — collectively known as MSH 
proteins — which may number as high as seven (MSH1 to MSH7), although the 
mitochondrial-localizing MSH1 of Saccharomyces cerevisiae (Reenan & Kolodner 1992) and 
the Arabadopsis thalia MSH7 (Culligan & Hays 1997) do not appear to be fully conserved 
(Fishel & Wilson 1997). The functionality of these proteins is similar to that of their 
counterparts in bacteria such as Eschericia coli. Prokaryotic MutS exists as a homodimer, 
while eukaryotic MSH proteins form heterodimers in the forms of MSH2-MSH3, MSH2- 
MSH6 and MSH4-MSH5. Two of these eukaryotic heterodimers (MSH2-MSH3 and MSH2- 
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MSH6) play fundamental roles in mitotic post-replicative MMR (Fishel & Wilson 1997; 
Kolodner 1996), in which incorrectly matched bases are replaced with proper partners. 

In spite of the high levels of sequence homology to the other MSH proteins, what role, if 
any, of the MutS homologues MSH4 and MSH5 in the process of MMR has yet to be 
experimentally determined, and the biochemical function(s) of the heterocomplex formed by 
MSH4 and MSH5 also awaits to be further delineated (Her et al. 2007). Nevertheless, high 
levels of expression of both genes in the testes and ovaries implicate a direct role for hMSH4 
and hMSH5 in development and meiosis (Bocker et al. 1999; Her & Doggett 1998; Moens et 
al. 2002). Genetic studies in S. cerevisiae, Caenorhabditis elegans, and Mus musculus highlight 
an important role for MSH4 and MSH5 in meiotic development of viable gametes, but 
provide no evidence to substantiate their potential role in the repair of mismatched bases 
like the other MutS homologues (de Vries et al. 1999; Edelmann et al. 1999; Hollingsworth et 
al. 1995; Kelly et al. 2000; Kneitz et al. 2000; Ross-Macdonald & Roeder 1994; Zalevsky et al. 
1999). The observation that purified recombinant hMSH4-hMSH5 heterocomplex can 
specifically bind to recombination intermediate structures such as the Holliday junction 
(Snowden et al. 2004) has implicated a direct role for hMSH4-hMSH5 in the process of 
meiotic and mitotic recombinational double-strand break (DSB) repair. Gene knockout of 
Msh4 and/or Msh5 in mice results in defective chromosome synapsis in meiotic prophase I, 
and therefore sterility—likely attributed to defective homologous recombination (de Vries et 
al. 1999; Edelmann et al. 1999; Kneitz et al. 2000). Although the expression patterns of MSH4 
and MSH5 mRNA in testis support their functional role in meiosis, low levels of MSH4 and 
MSH5 expression have been identified in many other, non-meiotic organs, and these two 
genes are not necessarily expressed in concert (de Vries et al. 1999; Edelmann et al. 1999; Her 
et al. 2001; Her et al. 2003; Her et al. 1999; Kneitz et al. 2000; Paquis-Flucklinger et al. 1997; 
Winand et al. 1998), implicating MSH4 and/or MSH5 can function in other pathways 
beyond the scope of meiosis and may function separately as well (Her et al. 2007). In fact, 
recent evidence supports the notion that MSH4 and MSH5 are involved in the process of 
mitotic DNA DSB repair, and may be also involved in other aspects of DNA damage repair 
and response (Tompkins et al. 2009; Sekine et al. 2007). 

2. Structural and functional features of MSH4 and MSH5 

2.1 Gene structure and expression of MSH4 and MSH5 

2.1.1 S. cerevisiae MSH4 and MSH5 

The identification of MSH4 and MSH5 was first performed in the budding yeast S. cerevisiae, 
and that led to the classification of these proteins as meiosis-specific members of the MutS 
homologue family. MSH4 was isolated by a transposon insertion screen resulting in the 
generation of lacZ fusion genes expressed specifically in meiotic cells (Burns et al. 1994). 
Following electrophoretic separation of yeast chromosomes, the MSH4 gene was mapped to 
chromosome VI by Southern blot analysis, located 2 cM from SEC4. Sequence analysis of an 
MSH4-complementing subclone identified an open reading frame (ORF) of 2634 bp 
encoding the 878-amino acid MSH4 protein. As a member of the MutS family, MSH4 shows 
35% identity to E. coli MutS and 31-36% identity to the other yeast MutS homologues. The 
similarity is most pronounced in the C-terminal region, which contains the putative ATP- 
binding domain and helix-turn-helix DNA-binding motif. Despite its extensive homology to 
bacterial MutS, which functions in the initiation of MMR through direct binding of 
mispaired bases, MSH4 is meiosis specific. Expression of mutant MSH4 results in defective 
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reciprocal recombination and nondisjunction of homologous chromosomes at meiosis I and 
consequently spore inviability. The MSH4 protein is specifically expressed in meiotic cells 
where it localizes to discrete chromosomal locations with no apparent involvement in MMR 
(Ross-Macdonald & Roeder 1994). 

The MSH5 gene is mapped to chromosome IV, located 1.9 cM from CDC36. The MSH5 ORF 
is composed of 2703 nucleotides and encodes a protein of 901 amino acids with a predicted 
molecular mass of 102 kDa (Hollingsworth et al. 1995). Although MSH5 exhibits strong 
homology to the MutS family of proteins, like MSH4, it is not involved in MMR. Diploids 
lacking MSH5 display decreased spore viability, increased chromosomal nondisjunction 
during meiosis I, with a concomitant decrease in reciprocal exchange between—but not 
within—homologous chromosomes. However, lack of MSH5 expression does not correlate 
to a decrease in gene conversion. Mutants lacking MSH4 or MSH5 are phenotypically 
comparable, indicating that these two yeast genes are in the same epistasis group and are 
likely to serve similar functions (Hollingsworth et al. 1995). 

2.1.2 Mouse MSH4 and MSH5 

Sequence analysis revealed that the mouse Msh4 ORF is 2874 bp in length with a 196 bp 3'- 
UTR followed by a poly(A) tract. A polyadenylation signal (AATAAA) is located 24 
nucleotides 5' upstream from the poly(A) tract. The 958-amino acid protein encoded by this 
ORF contains the highly conserved sequence motifs present in all MutS homologues. The 
cDNA shares 84.1% and 89.3% sequence identities with its human orthologue in nucleotide 
and amino acid sequences, respectively. The amino terminus of Msh4, on the other hand, is 
the most divergent and shares no sequence homology with that of the human hMSH4 (Her 
et al. 2001; Kneitz et al. 2000). Northern blot analysis has indicated that Msh4 is 
predominantly expressed in the testis, but low levels of expression are also present in the 
heart, brain, and liver, whereas dot-blot analysis, besides confirming the Northern results, 
has also revealed low levels of Msh4 expression in several other non-meiotic tissues (Her et 
al. 2001). 

The mouse Msh5 gene is located on chromosome 17 in a region that is syntenic to the locus 
on human chromosome 6 harboring the human gene. The gene for Msh5 contains 24 exons 
and spans approximately 18 kb with exon length varying from 36 bp for exon 7 to 392 bp for 
exon 24; intron lengths range from 79 bp for intron 17 to 4687 bp for intron 11. The first 248 
nucleotides of exon 1 and the last 133 nucleotides of exon 24 are noncoding. Comparison 
between the human and mouse homologues reveals that the mouse Msh5 gene shares a high 
degree of structural homology with that of the human hMSH5 gene. The locations of most 
exon-intron splicing junctions as well as the lengths of all internal coding exons of Msh5 are 
identical to that of HMSH5. The mouse Msh5 ORF is 2502 bp in length and encodes an 833- 
amino acid polypeptide with a predicted molecular weight of 92.6 kDa and an isoelectric 
point of 6.33. Msh5 contains the same set of highly conserved sequence motifs that present 
in all other MutS proteins (de Vries et al. 1999; Edelmann et al. 1999; Her et al. 1999). Like 
Msh4, high levels of Msh5 expression are largely confined to the testis while relatively low 
levels of Msh5 expression are detectable from heart, spleen, liver, and lung (Her et al. 1999). 
Both male and female mice lacking Msh4 or Msh5 are infertile as a result of meiotic arrest (de 
Vries et al. 1999; Edelmann et al. 1999; Kneitz et al. 2000). Spermatocyte chromosomes of 
Msh4 or Msh5 deficient mice do not synapse properly in late zygonema and early 
pachynema despite DSB formation. Chromosome pairing, normally triggered by meiotic 
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DSB, involves mainly nonhomologous chromosomes in Msh4 or Msh5 deficient mice, of 
which only a fraction of nuclei in Ms/r5-null males show partial pairing; while in Ms/t4-null 
spermatocyte pairing is typically higher at 70% of all nuclei. As a consequence, germ cells in 
Msh4 or Msh5 deficient mice fail to enter pachynema and die by apoptosis, leading to testes 
devoid of post-leptotene spermatocytes. In comparison, female mice deficient in Msh4 or 
Msli5 suffer similar consequences as in males experiencing pre-pachytene meiotic 
catastrophe, particularly oocytes become apoptototic prior to birth. In female mice lacking 
both Msh4 and Msh5, the oocyte pool is completely lost and is accompanied by ovarian 
degeneration during the first eight to ten weeks of postnatal period (Kneitz et al. 2000). 

2.1.3 Human hMSH4 and hMSH5 

The human HMSH4 gene is composed of 20 exons and spans 116 Kb on chromosome lp31. 
With a 2808 bp ORF, the HMSH4 gene encodes a protein of 936 amino acids with a predicted 
molecular mass of 104.8 kDa (Paquis-Flucklinger et al. 1997). In contrast, the HMSH5 gene 
contains 26 exons and spans approximately 25 Kb within the MHC class III region on 
human chromosome 6p21.3. The hMSH5 gene harbors a 2501 bp ORF encoding an 834- 
amino acid protein with a predicted molecular mass of 92.9 kDa (Her & Doggett 1998; 
Winand et al. 1998). Moderate levels of hMSH4 transcripts are present in the testis and low 
levels of hMSH4 transcripts are also detectable in several non-meiotic tissues including the 
thymus, ovary, colon, pancreas, brain, heart, liver, and placenta (Her et al. 2003; Paquis- 
Flucklinger et al. 1997). In contrast, the full-length hMSH5 transcripts are detectable in 
virtually all tissues examined with the most abundant expression in the testis. Northern blot 
analysis shows the presence of distinct hMSH5 hybridization signals in a variety of tissues 
such as the thymus, skeletal muscle, bone marrow, spinal cord, brain, trachea, ovary, and 
lymph node (Bocker et al. 1999; Her and Doggett 1998; Winand et al. 1998). This wide 
hMSH5 tissue distribution pattern has been also recapitulated in RT-PCR analysis of various 
human tissues and is well reflected in the human EST database. The difference in the 
expression patterns of these two genes raises the possibility that hMSH4 and hMSH5 may 
function independently of one another in certain tissue types beyond the meiotic arena (Her 
et al. 2007). 

2.1.4 Splicing variants and nonsynonymous polymorphisms of hMSH4 and hMSH5 in 
humans 

Recent evidence substantiates the existence of multiple alternatively spliced transcripts for 
hMSH4 and hMSH5 in human cells (Table 1). Although the functional significance of these 
alternatively spliced transcripts is presently unknown, the expression profiles and some 
properties of the proteins encoded by the splicing variants are being characterized. For 
instance, one hMSH4 splicing variant, hMSH4sv, resulting from exon 19-skipping, has been 
analyzed. hMSH4sv harbors a frameshift of 7 amino acids followed by a stop codon in exon 
20, thus producing an 850-amino acid polypeptide. Truncated at the carboxyl terminal, the 
hMSH4sv contains most of the conserved sequence motifs presented in all MutS 
homologues except for the carboxyl terminal helix-turn-helix motif (Her et al. 2003). The 
tissue distribution profile of hMSH4sv is similar but not identical to that of hMSH4, 
suggesting that hMSH4sv might be subjected to differential regulation in some tissue types 
including the heart, liver, placenta, and ovary (Her et al. 2003). Intriguingly, the protein 
encoded by hMSH4sv is incapable of interacting with hMSH5, but it does interact with von 
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Hippel-Lindau tumor suppressor-binding protein 1(VBP1) (Her et al. 2003). Also of note is 
another hMSH4 exon-skipping variant (AhMSH4) that lacks the entire exon 6 leading to an 
in-frame deletion of hMSH4 amino acid residues 273 to 330 (Her et al. 2003; Santucci- 
Darmanin et al. 1999). The hMSH4 homo dimerization domain, residing in between amino 
acid residues 148 and 387 (Her et al. 2003), significantly overlaps with the region encoded by 
hMSH4 exon 6, thus AhMSH4 is expected to be defective in homotypic interaction. 

The HMSH5 gene produces multiple alternative transcripts, of which four hMSH5 variants 
that maintain the reading frame have been identified (hMSH5a, hMSH5b, hMSH5c, and 
hMSH5d; UniGene database) and hMSH5c appears to be identical to that of the originally 
described human hMSH5. Referenced by the deduced amino acid sequence of hMSH5, it is 
evident that hMSH5a (hMSH5sv) encodes an 851-amino acid protein containing a 17-amino 
acid insertion between codons 179 and 180, owing to the retention of the last 51-bp of 
hMSH5 intron 6 (Yi et al. 2005), whereas hMSH5b harbors one extra amino acid residue 
between codons 654 and 655 — due to the retention of the last 3 nucleotides of HMSH5 intron 
20 (Her & Doggett 1998). hMSH5d represents the shortest hMSH5 variant. Although it 
contains the 17-amino acid insertion and the one extra amino acid residue described above, 
hMSH5d lacks 30 amino acid residues corresponding to codons 744 to 773. The existence of 
different hMSH5 variant transcripts keenly supports the possibility that hMSH5 may encode 
multiple products; a thorough understanding of the functional aspects of these potential 
protein isoforms requires detailed experimental examination of these variants. In particular, 
it is necessary to determine whether these hMSH5 variants are resulted from sequence 
variations within the corresponding introns, or they are created by a yet-to-be-defined 
mechanism. Presently, besides hMSH5, only one splicing variant, hMSH5sv, has been 
experimentally analyzed (Yi et al. 2005). The expression profile of hMSH5sv appears to be 
distinguishable from that of hMSH5; for example, the expression of hMSH5sv, but not 
hMSH5, is absent or below the detection limit in the brain, heart, and skeletal muscle. In 
addition, the expression of hMSH5sv displays a large variation in tumor cell lines with 
breast and lung carcinomas showing the most abundant expression. In contrast to 
hMSH4sv, displaying impaired interaction with hMSH5, hMSH5sv has maintained its 
capacity to interact with hMSH4 (Yi et al. 2005). 

In addition to the existence of multiple alternatively spliced transcripts, potential diverse 
functions involved with the HMSH4 and HMSH5 genes are also being reflected by the fact 
that both genes are associated with many coding region single nucleotide polymorphisms 
(SNPs), of which many are non-synonymous. There are at least seven non-synonymous 
SNPs that have been identified for each of the genes (Table 1). For hMSH4 and hMSH5, the 
corresponding single amino acid changes caused by these SNPs are hMSH4 A60V, A90T, 
A97T, E162K, I365V, Y589C, S914N, and hMSH5 P29S, L85F, Y202C, V206F, R351G, L377F, 
P786S. However, the allele frequencies, haplotypes, and functional implications of most, if 
not all, SNPs are largely undetermined; in fact only one non-synonymous SNP (rs2075789), 
hMSH5 C85T (hMSH5 p29S ), has been characterized experimentally as a common genetic 
polymorphism with an allele frequency of 11.6% in an American Caucasian population of 99 
individuals and 17% in a Chinese population of 279 individuals (Her et al. 2007; Yi et al. 
2005). Located within the hMSH5 amino terminal proline-rich interacting domain for 
hMSH4 and c-Abl, the Pro 29 to Ser alteration causes a moderate reduction of protein 
interaction with hMSH4, whereas this alteration promotes the activation of c-Abl kinase 
activity and therefore enhances ionizing radiation (IR) induced p73-dependent apoptosis 
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Variants and 
nonsynonymous 
polymorphisms 

Changes in amino acid (aa) residues 

hMSH4 

hMSH4sv 

Exon 19 skipping leading to the 
production of a 850 aa polypeptide, of 
which the last 7 aa are frame-shifted 

AhMSH4 

Exon 6 skipping leading to the deletion of 
aa 273 to 330 

A60V 

Ala 60 to Val 

A90T 

Ala 90 to Thr 

A97T 

Ala 97 to Thr 

E162K 

Glu 162 to Lys 

I365V 

H e 365 to Val 

Y589C 

Tyr 589 to Cys 

S914N 

Ser 914 to Asn 

hMSH5 

hMSH5a (hMSH5sv) 

17 aa insertion between aa position 179 
and 180 

hMSH5b 

1 aa insertion between aa position 654 and 
655 

hMSH5d 

17 aa insertion between aa position 179 
and 180 

1 aa inertion between aa position 654 and 
655 and deletion of aa 744 to 773 

P29S 

Pro 29 to Ser 

L85F 

Leu 85 to Phe 

Y202C 

Tyr 202 to Cys 

V206F 

Val 206 to Phe 

R351G 

Arg 351 to Gly 

L377F 

Leu 377 to Phe 

P786S 

Pro 786 to Ser 


Table 1. hMSH4 and hMSH5 splicing variants and nonsynonymous polymorphisms. 


(Tompkins et al. 2009; Yi et al. 2006; Yi et al. 2005). Given the essential role of Msh5 in ovarian 
and testicular development in mice (de Vries et al. 1999; Edelmann et al. 1999), it is 
interesting to note that the hMSH5 C85T SNP is relatively enriched in ovarian cancer 
patients and is associated with a higher risk for azoospermia or severe oligozoospermia in 
humans (Xu et al. 2010; Yi et al. 2005). Another noteworthy hMSH5 variant is 
hMSH5 L85F / p786S —encoded by an hMSH5 allele harboring two co-segregating SNPs (C253T 
and C2356T) — that displays compromised ability to interact with hMSH4. Intriguingly, the 
allele for hMSH5 L85F / p786S has been associated with the occurrence of human 
immunoglobulin deficiency syndromes, i.e. IgA deficiency (IgAD) and common variable 
immune deficiency (CVID) (Sekine et al. 2007). 

It is also important to note that two hMSH5 noncoding SNPs have been linked to conditions 
in humans as well. A recent genome-wide association study (GWAS) of 511,919 SNPs in 
populations with Caucasian origin has identified a high risk factor for lung cancer 
development within the hMSH5 gene locus at 6p21.33 (Wang et al. 2008). Specifically, a 
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significant increase in lung cancer susceptibility is associated with rs3131379, a SNP located 
within intron 10 of the HMSH5 gene. Another study, designed to identify genetic markers for 
the adverse reaction associated with the use of Allopurinol — a common medication for gout 
and hyperuricemia, has revealed a tight link with the HMSH5 locus (Hung et al. 2005). The 
evidence from this study demonstrates a significant association of a separate hMSH5 SNP 
(rsll50793) with the risk of developing severe cutaneous adverse reactions (SCAR) in Han 
Chinese patients treated with Allopurinol. 

Although the existence of multiple hMSH4 and hMSH5 splicing variants and various SNPs 
poses a daunting task for a thorough appreciation of their functions, close analysis of their 
properties at molecular and cellular levels, especially for those with clinical significance, 
would be necessary for delineating the mechanistic basis underlying their potential link to 
disease conditions in humans. It is highly plausible that functional effects similar to those 
observed for hMSH5 P29S could also be conferred by other hMSH5 and/or hMSH4 non- 
synonymous SNPs. It should not be a surprise that different combinations of these SNPs 
might associate with an array of subtle functional alterations; that, to a certain extent, could 
also affect the dynamic interplay among hMSH4-hMSH5 associated proteins and 
subsequent downstream events. 

2.2 Structural properties of MSH4 and MSH5 proteins 

2.2.1 General molecular structure of MutS homologues 

A common characteristic of MutS homologous proteins is their essential role in binding and 
recognizing mismatched base pairs —a function facilitated by their ability to act as DNA- 
binding ATPases. Whilst structures of eukaryotic MSH proteins remains undetermined, 
insights towards the mechanistic aspects of mismatch recognition may be inferred from the 
crystal structure of bacterial MutS protein. Crystallographic studies of homologues in T. 
aquaticus and E. coli suggest that the MutS homodimer binds to heteroduplex DNA during 
mismatch recognition as a "structural heterodimer" (Junop et al. 2001; Laniers et al. 2000; 
Obmolova et al. 2000). The homodimerization of MutS protein is mediated through the 
region harboring MutS ATPase activity. The dimerization domain is far apart from the DNA 
binding domain, but these regions coordinate through conformational changes triggered by 
MutS binding to heteroduplex DNA or ATP (Lamers et al. 2004; Lamers et al. 2003). In 
essence, these studies indicate that MutS is a modular protein with separate domains which, 
when dimerized at their carboxyl termini, act to encircle mismatch-containing DNA during 
the initiation stages of repair process. Deletion analysis of MutS protein confirms that the 
carboxyl terminus, which also includes a P-loop motif for nucleoside triphosphate binding, 
is involved in homodimerization. The amino terminal end is necessary for binding to 
mismatch-containing DNA, and through the binding and hydrolysis of ATP in the carboxyl 
terminal, MutS may dissociate from mismatched DNA once repair is underway. Although 
no crystallographic analysis of eukaryotic MSH proteins is presently available, noting the 
high degree of homology between MutS homologues and their bacterial counterparts, it is 
not difficult to conjecture that the MSH proteins found in yeast, mouse, and humans may 
contain similar structural features to those of prokaryotic MutS proteins. 

2.2.2 Eukaryotic MSH4-MSH5 complex 

Whereas functional bacterial MutS protein exists as homodimers, the eukaryotic MSH 
family is far more complex and contains multiple heterodimers composed of different MSH 
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proteins. MSH heterodimers carry out diverse cellular functions including MMR and DNA 
damage response. However, MSH4 and MSH5 are unique in their intracellular roles. 
Although they contain the conserved sequence motifs found in all MutS homologues —such 
as the ATP binding domain and a helix-turn-helix structural motif located at the carboxyl 
terminal half of the protein (Bums et al. 1994; Her & Doggett 1998; Her et al. 2001; Her et al. 
1999; Hollingsworth et al. 1995; Paquis-Flucklinger et al. 1997) —neither MSH4 nor MSH5 
interact with the other MSH proteins known to function in MMR. MSH4 and MSH5 interact 
with each other exclusively, forming a distinctive heterocomplex (Bocker et al. 1999; Her et 
al. 2001; Her et al. 1999; Winand et al. 1998; Yi et al. 2005). Unlike other MSH proteins, which 
contain specific amino acid residues for recognition and binding of mismatched base pairs, 
MSH4 and MSH5, either individually or as a heterocomplex, are unable to detect or respond 
to mismatches as they lack structural motifs for binding of mismatched base pairs 
(Obmolova et al. 2000). It is postulated that the MSH4-MSH5 heterocomplex may configure 
in a way that can accommodate large recombination intermediate DNA structures such as 
Holliday junction intermediates (Obmolova et al. 2000). Evidence obtained with the purified 
recombinant hMSH4-hMSH5 heterocomplex points to their role in the recognition and 
binding of artificial DNA structures resembling four-way junctions (Snowden et al. 2004), 
suggesting that the hMSH4-hMSH5 heterocomplex is uniquely equipped for processing 
recombinational intermediates. 

The human hMSH4 and hMSH5 interact with one another via the carboxyl terminal region 
of hMSH4 and both the amino and carboxyl terminal regions of hMSH5. The first 109 and 
the last 103 amino acid residues of hMSH5 are necessary for the formation of a composite 
hMSH4-interacting domain; however, only the carboxyl terminal 93 amino acid residues of 
hMSH4 are required to interact with hMSH5 (Yi et al. 2005), suggesting an asymmetric 
structural partition of hMSH4 and hMSH5 in the heterocomplex. The hMSH4-hMSH5 
heterocomplex is suggested to form a sliding clamp structure that stabilizes and preserves 
Holliday junctions during prophase of meiosis I or in the repair of DSBs. A model for 
hMSH4-hMSH5 in meiotic recombination has been proposed (Snowden et al. 2004), 
implicating their role in linking DSB repair to the regulation of crossover (CO) formation. It 
is known that the amino terminal region of hMSH4, composed of amino acid residues 148- 
387, is involved in mediating homotypic interaction (Her et al. 1999; Lee et al. 2006). Due to 
the physical separation of the hetero- and homo-interacting domains on hMSH4, it has been 
suggested that hMSH4 and hMSH5 may form a multimeric protein complex such as a 
tetramer. In addition, it has been demonstrated that the interface of hMSH4-hMSH5 
heterocomplex forms a composite interaction domain for GPS2; the latter is a protein factor 
involved in intracellular signaling and DNA damage response (Jin et al. 1997; Lee et al. 2006; 
Peng et al. 2001; Spain et al. 1996). The interplay of GPS2 with the hMSH4-hMSH5 
heterocomplex may provide a link to downstream molecular events required for Holliday 
junction processing and subsequent resolution. 

2.2.3 Protein interacting partners of MSH4 and MSH5 

The MSH4-MSH5 heterocomplex is thought to participate in a limited array of functions, 
leaving the individual proteins to coordinate specific cellular processes independent of one 
another throughout various mammalian tissues. It has been shown that hMSH4 physically 
interacts with hMLHl as well as its binding partner hMLH3, in which the amino terminal 
region of hMSH4 interacts with hMLHl proteolytic degradation products, rather than the 
full-length hMLHl (Lipkin et al. 2000; Santucci-Darmanin et al. 2002; Santucci-Darmanin et 
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al. 2000). Furthermore, hMSH4 interacts with VBP1, Rad51, and DMC1 (Her et al. 2003; Her 
et al. 2007; Neyton et al. 2004). In addition to its interacting partner hMSH4, hMSH5 has been 
shown to interact with the non-receptor tyrosine kinase c-Abl, hMREll, and histone 
demethylase SMCY (Akimoto et al. 2008; Kato et al. 2007; Yi et al. 2006), of which the 
interaction with c-Abl mediates hMSH5 tyrosine phosphorylation in response to IR-induced 
DSBs (Yi et al. 2006). Coherent with this observation, the ubiquitous DNA damage repair 
protein hRad51 coexists in the protein complex containing both hMSH5 and c-Abl (Her et al. 
2007). 

Interaction with different protein partners may provide a foundation for hMSH4 and 
hMSH5 to act independently in specific cellular processes, during which protein interactions 
can also modulate the functions of hMSH4 and hMSH5. For example, the interaction of 
VBP1 with hMSH4 negatively regulates the formation of hMSH4-hMSH5 heterocomplex 
(Her et al. 2003). The biological relevance of this observation in mitotic cells is presently not 
known. It is reported that, during early stages of mouse testis development, the up- 
regulation of GPS2 coincides with the down-regulation of VBP1 immediately prior to, or at 
the onset of, the first meiotic wave, presumably facilitating the formation of Msh4-Msh5 
heterocomplex (Lee et al. 2006). The hMSH4-hMSH5 interaction is also subjected to 
regulation by c-Abl-mediated hMSH5 tyrosine phosphorylation. In particular, hMSH5 is 
shown to undergo IR-induced c-Abl-dependent tyrosine phosphorylation, and consequently 
this posttranslational modification leads to the dissociation of hMSH4-hMSH5 
heterocomplex (Tompkins et al. 2009; Yi et al. 2006). Since the formation of hMSH4-hMSH5 
heterocomplex is required for the interaction with GPS2 (likely in a complex with HDAC3), 
it is expected that hMSH5 tyrosine phosphorylation will result in a dynamic transformation 
of the hMSH4-hMSH5 associated protein complex, which might be functionally required 
during recombinational DSB repair. 

Recent evidence suggests that factors influence the hMSH4-hMSH5 interaction will also 
affect their subcellular localization. It appears that hMSH4-hMSH5 dimerization enhances 
their nuclear localization—possibly facilitated by a nuclear localization signal (NLS) located 
in the middle of hMSH5 protein, or by masking of a CRMl-dependent nuclear export signal 
(NES) on the carboxyl terminal region of hMSH5 within the hMSH4-interacting domain 
(Lahaye et al. 2010; Neyton et al. 2007). 

2.3 Functions of MSH4 and MSH5 
2.3.1 Role in meiotic recombination 

Meiotic recombination occurs in meiotic prophase starting with the formation of 
programmed DSBs induced by the expression of SPOll, a protein highly conserved and 
enriched in germ cells at recombination hotspots (Baudat & de Massy 2007; Keeney et al. 
1997). After 5' end resection at the break, the emerging 3' single-stranded overhang invades 
undamaged homologous chromosome or sister chromatid, leading to the formation of 
meiotic recombination intermediate structures including the Holliday junction and thus 
facilitating DSB repair (Szostak et al. 1983). The completion of meiotic homologous 
recombination is achieved through the resolution of Holliday junction structures by 
cleavage and rejoining to re-form two separate DNA molecules. The two major outcomes of 
meiotic homologous recombinational DSB repair are chromosomal reciprocal exchange or 
CO and gene conversion or non-crossover (Mahadevaiah et al. 2001). While non-crossover is 
important for conserving genetic identities, chromosomal CO is crucial for creating genetic 
diversity and therefore promoting survival along the way of evolution. Furthermore, COs 
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are critical for successful meiosis, of which both the frequency and the distribution of COs 
are well controlled in a way that governs the fidelity of chromosome segregation (Youds & 
Boulton 2011). 

Studies carried out in budding yeast have provided evidence to indicate that both MSH4 
and MSH5 are involved in meiotic recombination but not in MMR (Hollingsworth et al. 
1995; Ross-Macdonald & Roeder 1994). MSH4-null mutation in S. cerevisiae is not associated 
with elevated rate in mismatches of reporter genes Can R (Reenan & Kolodner 1992) and 
Thr+ (Kramer et al. 1989) or defective gene conversion, instead the MSH4-null strain 
displays 2 to 3-fold reduction in CO and an increase in homologous chromosome 
nondisjunction, leading to reduced spore viability (Ross-Macdonald & Roeder 1994). Similar 
effects are evident in S. cerevisiae with mutant MSH5, in which MMR is proficient in the 
processes of both meiosis and mitosis, and meiotic gene conversion frequency is not 
significantly different from the wild type. However, CO is reduced 2 to 3-fold and 
homologous chromosome nondisjunction is elevated in meiosis I. As a consequence, spore 
viability is reduced to 37% from 72% observed in the wild type (Hollingsworth et al. 1995). S. 
cerevisiae strain lacking both MSH4 and MSH5 does not show a synergistic effect on CO 
frequency in meiosis I, and the spore viability is compatible with either mutants, indicating 
MSH4 and MSH5 act in the same pathway that facilitate meiotic crossing over 
recombination (Hollingsworth et al. 1995). Decreased crossover interference and delayed or 
incomplete chromosome synapsis are also observed in MSH4 mutant yeast, suggesting a 
role for MSH4 in the regulation of crossover distribution (Novak et al. 2001). 

Like budding yeast, C. elegans MSH4 and MSH5 are functionally conserved with respect to 
their role in promoting meiotic CO. With a null mutation in him-14, the C. elegans ortholog of 
MSH4, CO is abolished during meiosis I in both oocytes and spermatocytes. As a result, 
formation of chiasmata is absent and chromosome segregation is severely defective. Most of 
the him-14 embryos, although produced in normal amounts, fail to hatch; and and among the 
hatched that reached to adulthood, 45% are males (Zalevsky et al. 1999; Zetka & Rose 1995). 
This is consistent with an increase in non-disjunction, because males are normally arisen 
through spontaneous non-disjunction with a frequency of about 0.1% in the hermaphrodite 
germ line. The him-14 mutants, however, do not show any abnormality in meiotic 
chromosome pairing and synapsis (Zalevsky et al. 1999). Similar to their budding yeast 
counterparts, the mutant worms do not display increased "mutator" phenotype, assayed by 
the means of levamisole resistance, indicating that Him-14 is not required for MMR in C. 
elegans (Zalevsky et al. 1999). In spite of high degree of conservation, C. elegans and S. cerevisiae 
MSH4 mutants differ significantly in their effects on CO. Without MSH4, CO is completely 
abrogated in C. elegans (Zalevsky et al. 1999), yet about 30-50% is retained in the budding yeast 
(Ross-Macdonald & Roeder 1994). This difference has helped to fashion the current view that 
C. elegans has a single or a dominant pathway to create meiotic CO for which MSH4 is 
indispensible, and budding yeast has alternative MSH4-independent pathway(s) for CO 
generation. While MSH4 is crucial for proper chromosome segregation, in contrast to yeast, C. 
elegans Him-14 does not appear to act on chromosome pairing and synapsis. 

Likewise germ cells from C. elegans msh-5 mutants are able to progress through meiosis with 
normal chromosome pairing and synapsis. Msh-5 mutants lay eggs at a same rate as the 
wild type, and they produce a normal number of embryos, however, more than 97% of the 
embryos fail to hatch and 42% of those survived to adulthood are males, owing to defective 
segregation of homologous chromosomes (Kelly et al. 2000). Similar to the him-14 mutants, 
CO frequency in Msh-5 mutants is reduced to about 1 % of the wild type, and these mutants 
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lack chiasmata formation (Kelly et al. 2000). On the basis of their nearly identical effect on 
meiotic recombination, it is conceivable that both MSH4 and MSH5 are indispensible for C. 
elegans meiotic CO, and these two MutS homologues act in the same process through 
apparently the only pathway in this species. 

In mice, meiotic recombination is also initiated by Spoil-dependent DSBs (Mahadevaiah et 
al. 2001). Msh5 knockout mice are generally healthy and undergo normal development, 
however their reproductive organs do not develop properly. For example, testes in Msh5- 
null animals are significantly smaller in size and ovaries quickly become rudimentary after 
birth (de Vries et al. 1999; Edelmann et al. 1999). Although their mating behavior appears 
normal, Ms/75-null mice are sterile. Histological analysis demonstrates that the testes of 
Mste-null mice are completely devoid of epididymal spermatozoa, presumably due to 
apoptosis before pachytene — the stage of meiosis when crossover structures between 
homologous chromosomes become experimentally visible (de Vries et al. 1999; Edelmann et 
al. 1999). Closer examination of Ms/i5-null spermatocytes and oocytes reveals disrupted 
chromosome pairing and impaired synaptonemal complex formation, indicating Msh5 is 
essential for homologous chromosome pairing and synapsis during meiosis I in mice (de 
Vries et al. 1999; Edelmann et al. 1999). Since the single strand DNA binding protein Rad51 
typically represents a marker for recombination initiation (Bishop 1994; Moens et al. 2002), 
the observed Rad51 foci formation on unsynapsed chromosomes in Ms/75-null mice tends to 
suggest that Msh5 is not involved in the initiation of meiotic recombination (Edelmann et al. 
1999). 

Similarly, MsM-null mice develop normally but are infertile. These mutant mice display 
severe abnormality in chromosome pairing in early prophase I (Kneitz et al. 2000), indicating 
Msh4 is also required for meiotic homologous chromosome pairing. However, Msh4 
deficiency appears to cause less severe defect in chromosome pairing in comparison to 
Ms/75-null animals. Approximately 69% of nuclei in MsM-null spermatocytes have some 
degree of chromosome pairing, but less than 10% of the nuclei contain very few paired 
chromosomes in Ms/75-null mice (Kneitz et al. 2000). Meiotic phenotypes of Msh4 and Msli5 
double-knockout mice are similar to the Ms/75-null mice, suggesting that Msh4 and Msh5 act 
together in promoting meiotic chromosome pairing and synapsis in the early phase of 
meiotic recombination, possibly with Msh5 upstream of Msh4 (Kneitz et al. 2000). 

In human oocytes, both hMSH4 and hMSH5 proteins are localized on meiotic chromosomes 
throughout meiotic prophase I and become co-localized at zygonema with SYCP3, a key 
component of synaptonemal complex, thus supporting the idea that human hMSH4 and 
hMSH5 may possess meiotic properties similar to their mouse counterparts (Lenzi et al. 
2005). This view is further supported by the physical interaction observed between hMSH4 
and hRad51 as well as the co-localization of their counterparts in mouse spermatocytes 
(Neyton et al. 2004) — indicative of an early role for MSH4, presumably for MSH5 as well, in 
meiotic recombination. The presence of hMSH4 and hMSH5 foci on meiotic chromosomes 
after synaptonemal complex formation is highly suggestive of an additional role for these 
proteins in the late stages of meiotic recombination (Lenzi et al. 2005). Consistent with this 
speculation is the observed in vitro physical interaction between hMSH4 and hMLHl, the 
later is commonly regarded as a marker for crossing over recombination (Lynn et al. 2004). 
Together with the co-localization of these two proteins in mouse spermatocytes (Santucci- 
Darmanin et al. 2000), it is suggested that MSH4 and MSH5 play at least two separate roles 
in meiosis I — an early role in homology searching that leads to proper chromosome pairing 
and a late role in the processing of recombination intermediate structures. Binding of 
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purified hMSH4-hMSH5 heterocomplex to the core of the Holliday junction intermediate 
structures has provided in vitro evidence to support the late role of these two proteins. It is 
hypothesized that the binding of progenitor Holliday junction DNA provokes ATP binding 
to both hMSH4 and hMSH5 in the heterocomplex, and the loading of hMSH4-hMSH5 is 
projected to stabilize and preserve meiotic recombination intermediates prior to proper 
resolution (Snowden et al. 2004). 

In short, both MSH4 and MSH5 play important roles in meiotic recombination. The 
similarity in phenotypes among MSiT4-null, MSH5-null, and MSft4-null/MSiT5-null 
organisms suggests that these two MutS proteins act in the same pathways and likely 
function as a heterocomplex at least during certain phases of the recombination process. 
Although MSH4 and MSH5 homologues in lower eukaryotes and mammals appear to share 
conserved properties supporting their common function in meiosis, their precise effects 
diverge in different species. For instance, the major action of MSH4 and MSH5 in lower 
eukaryotes is on the generation and control of COs, whereas in mammals these two MutS 
homologues have evolved to exert an early role in governing homologous chromosome 
pairing and synapsis. 

2.3.2 Role in mitotic recombination and DNA damage response 

The most described function of MMR proteins is the repair of base-pairing errors arising 
from DNA replication or recombination. Defects in MMR proteins lead to mutations and 
MSI. Mutations in MMR genes are known to cause Lynch syndrome (or HNPCC) and to 
increase the risk and progression of a wide-variety of sporadic cancers. During MMR, 
replication errors are recognized either by MutSa, a heterodimer of MSH2 and MSH6 that 
binds to single base-base mismatches or small insertion-deletion loops, or by MutSp, a 
heterodimer of MSH2 and MSH3 that binds to larger insertion-deletion loops (Jiricny 2006). 
The repair process then proceeds with the recruitment of a MutL activity ( i.e. one of these 
three complexes: MLH1-PMS2, MLH1-PMS1 or MLH1-MLH3) and exonucleases to facilitate 
subsequent excision followed by DNA repair synthesis. 

The MMR proteins are frequently recognized as major players in mediating cellular 
responses to DNA damage. As components of the BRCA-1-associated genome surveillance 
complex (Wang et al. 2000), MMR proteins are thought to mediate DNA damage response 
through either direct sensing of various DNA distortions or functioning as transducers to 
couple damage detection and the control of cell cycle progression (Jiricny 2006). For 
instance, MMR proteins are involved in provoking a G2/M phase cell cycle arrest that is 
vital for both DNA repair and cell death by apoptosis, especially for cells carrying excessive 
DNA lesions. To this end, it is important to note that MMR deficiency has been linked to 
DNA damage tolerance, and MMR deficient cells are found to be frequently resistant to 
killing by various chemotherapeutics including alkylating agents and antimetabolites. Given 
that resistance to chemotherapeutic agents is of a great concern in cancer treatment, a 
thorough understanding of the molecular mechanisms involved with MMR in cellular DNA 
damage response will undoubtedly pave a way for devising more effective therapeutic 
strategies. 

Although MSH4 and MSH5 have not been indicated experimentally to function in the 
process of MMR, cumulating evidence has pointed to their potential involvement in mitotic 
DNA repair and damage response in addition to their role in meiosis. The discrepant 
expression patterns of MSH4 and MSH5 in mammalian tissues have suggested that these 
two proteins may function independently of one another and may operate beyond meiosis 
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(Her et al. 2007; Yi et al. 2005). Studies in yeast have raised the possibility that MSH5 may be 
involved in cellular response to DNA damage induced by alkylating agents. Specifically, a 
mutant MSH5 allele ( i.e. msh5-14, encoding MSH5 Y823H ) but not MSH5-null mutant in the 
yeast strain XS-14 (lacking 0 6 -methylguanine methyltransferase) confers cellular tolerance 
to alkylating compound N-methyl-N'-nitro-N-nitrosoguanidine (MNNG) (Bawa and Xiao 
1997; 2003). Although the exact mechanism of action remains to be explored, these studies 
do indicate that this gain-of-function MSH5 mutant is involved in mediating mitotic cellular 
response to DNA damage. 

It is evident that many of the proteins involved in meiotic recombination are also important 
players in mitotic recombinational DSB repair. Thus, the involvement of MSH5 in meiotic 
recombination is suggestive of a potential role for MSH5 in mitotic recombinational DSB 
repair. DSBs are considered to be the most lethal form of DNA lesion that may arise from 
replication fork collapse, exposure to DNA damaging agents, or programmed cellular 
processes including the initiation of meiotic recombination in germ cells, class-switch 
recombination (CSR), and V(D)J recombination in lymphocytes (Ataian & Krebs 2006). In 
response to DSB formation, dividing cells usually undergo G2/M phase arrest, then either 
repair the lesion and resume cell cycle or enter the path of apoptosis depending on the 
extent of damage. Cells lacking proper G2/M cell cycle arrest, apoptosis, or damage repair 
are often at a higher risk for malignant transformation. The repair of DSBs requires either 
the non-homologous end joining (NHEJ) or the homologous recombination pathways. 
While NHEJ is a rapid means utilized by somatic cells to repair DSBs, it is error-prone and 
can result in alteration of DNA sequences for non-compatible breaks (Dery & Masson 2007; 
Helleday et al. 2007). On the other hand, homologous recombination is a more accurate 
repair pathway during S/G2 phases as it utilizes the homologous template usually provided 
by a sister chromatid or homologous chromosome (Saleh-Gohari & Helleday 2004). Similar 
to meiotic homologous recombination, this homology-directed DSB repair will also produce 
two alternative outcomes, CO or non-crossover/gene conversion. Loss of essential 
homologous recombination gene products often results in chromosome instability, by which 
cells exhibit increased sensitivities to a variety of DNA damage agents such as IR, cisplatin, 
and gemcitabine. In addition, homologous recombination deficient cells may also display 
DNA damage tolerance and resistance to killing by DNA damaging agents as well (Crul et 
al. 2003; Khanna & Jackson 2001; Takata et al. 2001; van Waardenburg et al. 2004; Zdraveski 
et al. 2000). 

As supported by the observed interactions with hRad51, c-Abl, and hMREll as well as 
structures resembling Holliday junctions (Her et al. 2007; Kato et al. 2007; Neyton et al. 2004; 
Snowden et al. 2004; Yi et al. 2006), hMSH4 and hMSH5 are expected to play functional roles 
in mitotic DNA damage response. In C. elegans oocytes, silencing of RAD51 on a MSH5 
deficient background results in chromosome fragmentation, while a comparable defect is 
also noted when MSH4 and BRCA1 homologues are concomitantly silenced, indicating the 
existence of functional interplay of these proteins in the maintenance of chromosome 
integrity (Adamo et al. 2008; Rinaldo et al. 2002). The interaction between hMSH5 and c-Abl 
can provoke two different cellular actions depending on the severity of DNA damage 
(Tompkins et al. 2009). Endogenous hMSH5 protein undergoes IR dose and time dependent 
induction, and it appears that increased levels of hMSH5 can promote IR-triggered 
apoptosis. However, this effect is more prominent in cells treated with a relatively high dose 
of IR (> 2 Gy). This is reminiscent of a previous observation that the linear correlation 
between DSB repair and the number of yH2Ax foci only exists in cells irradiated with IR at 
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doses below 2 Gy (Bouquet et al. 2006), suggesting the existence of a dynamic regulatory 
mechanism controlling DSB repair and apoptosis. Present evidence is compatible with the 
view that the expression of hMSH5 is normally maintained at a low level in unperturbed cells, 
and DNA damage-triggered hMSH5 induction promotes c-Abl activation and subsequent 
initiation of a p73-mediated caspase 3-dependent apoptotic response (Tompkins et al. 2009). 
Coherently, the peculiar interaction between hMSHS 1,298 and c-Abl is capable of over-activating 
c-Abl, leading to increased cellular radiosensitivity. In contrast, moderate hMSH5 induction is 
expected to facilitate recombinational DSB repair. Recent experimental results have 
demonstrated a functional requirement for c-Abl-mediated hMSH5 phosphorylation in DSB 
repair (Her et al. 2007; Tompkins et al. 2009). It is demonstrated, by the use of a chromosomally 
integrated recombination reporter, that hMSH5 tyrosine phosphorylation is an essential early 
event for non-crossover DSB repair. In fact, cells harboring a phosphorylation deficient hMSH5 
mutant are more sensitive to DSB-inducing anticancer drug cisplatin. Collectively, these 
studies tend to suggest a dual role for the induction of hMSH5 protein in recombinational DSB 
repair and DNA damage-triggered apoptotic response. 

2.3.3 Role in immunoglobulin diversification 

Genomic rearrangement that occurs during B-cell development in the form of V(D)J 
recombination is essential for the generation of antibody repertoire, and additional diversity 
is achieved by elevated mutation rate and gene sequence shuffling in corresponding 
processes known as somatic hypermutation (SHM) and CSR (Li et al. 2004b; Schroeder & 
Cavacini 2010). While SHM targets the "hot spots" in variable (V) region of immunoglobulin 
(Ig) to enhance antigen recognition, CSR is responsible for Ig isotype switching by way of 
recombining the switch (S) regions upstream of each functional genes (Ch) in the constant 
(C) region of Ig heavy (H) chain, i.e. switching from IgM or IgD to other isotypes that 
mediate diverse effector functions in the daughter cells (Li et al. 2004b; Schroeder & Cavacini 
2010). Both SHM and CSR start from deamination of dC to dU by activation-induced 
cytidine deaminase, a centroblast B-cell specific protein (Muramatsu et al. 2000; Muramatsu 
et al. 1999). Deamination of dCs leads to G-U mismatches and may subsequently generate C 
to T or G to A mutations. In addition, uracil N-glycosylase, apurinic/apyrimidinic 
endonucleases, and MMR proteins are required to generate extended single base mutations 
and single-stranded DNA nicks (Di Noia & Neuberger 2002; Ehrenstein & Neuberger 1999; 
Ehrenstein et al. 2001; Guikema et al. 2007; Imai et al. 2003; Martin & Scharff 2002; Rada et al. 
2002; Schrader et al. 1999). During CSR, DSBs have to be generated in the S regions, possibly 
by way of single-strand breaks, to allow intrachromosomal deletion via recombination 
between the two S regions (Wuerffel et al. 1997), in which DSBs are primarily repaired by 
NHEJ. The tandem repeats of short consensus sequences in the S regions are too short for 
homologous recombination, however they are enough for microhomology-mediated end 
joining (MMEJ), a specified type of NHEJ using small homologous regions to anneal the two 
overhangs at the break site. It is generally accepted that MMEJ represents a prominent 
process of NHEJ in CSR (Renter 1999), and the length of microhomology at S-S junctions 
may change due to impaired MMEJ (Stavnezer 2000). 

It is well documented that proteins involved in MMR and DSB repair are also involved in the 
process of CSR. In fact, Msh2, Msh6, Mlhl, and Pms2 knockout mice all exhibit impaired CSR 
and SHM (Ehrenstein & Neuberger 1999; Martin & Scharff 2002; Schrader et al. 1999). Mice 
deficient for early DSB response factors ATM or H2AX, and NHEJ proteins Ku70, Ku80, or 
DNA-PKcs, show reduced CSR activities but with normal SHM (Bemark et al. 2000; Casellas et 



The Role of MutS Homologues MSH4 and MSH5 in DNA Metabolism and Damage Response 


101 


al. 1998; Manis et al. 2002; Manis et al. 1998; Reina-San-Martin et al. 2004; Reina-San-Martin et al. 
2003). MutS and MutL homologues are not only required to assist MMEJ, but they also play 
overlapping and distinct roles in the process of switch recombination (Ehrenstein et al. 2001; Li 
et al. 2004a; Schrader et al. 1999; Schrader et al. 2002). This assessment is mainly obtained 
through analyzing microhomology at the S-S junctions as well as the distribution of 
breakpoints in the absence of each protein (Ehrenstein et al. 2001; Li et al. 2004a; Schrader et al. 
1999; Wu et al. 2006). Collectively, deficiency of MutS and MutL homologues has been 
associated with three different phenotypes at the S-S junctions— Msh2 or Mlh3 deficiency leads 
to a decrease in the length of microhomology (Schrader et al. 2002; Wu et al. 2006), Ms/ 76 -null B 
cells show no change of microhomology (Li et al. 2004a), and an increase in the length of 
microhomology is evident in Mlhl- or Pms2-null B cells (Ehrenstein et al. 2001; Schrader et al. 
2002). MLH1-PMS2 acts downstream of MSH2-MSH6 during MMR, similar effect would be 
anticipated if the process of MMR played a predominant role. The presence of different 
microhomology phenotypes indicates that these MMR proteins are involved, at least in part, in 
different sub-pathways during MMEJ. Indeed, PMS2 has been speculated to interact and 
stabilize MMEJ intermediates, whereas MSH2 might participate in DNA end processing 
(Ehrenstein et al. 2001; Schrader et al. 1999; Schrader et al. 2002). 

Present evidence suggests that MSH4 and MSH5 are additional players in CSR. These two 
MutS homologues are known to function in the early and late steps of meiotic 
recombination as well as in mitotic DNA damage response — events that share some 
similarities to crucial steps in the complex CSR process. Transcripts of HMSH5 are present in 
human spleen and peripheral blood B cells (Her & Doggett 1998; Sekine et al. 2007). 
Association studies have linked one of the non-synonymous alleles of HMSH5 to IgAD and 
CVID, two syndromes that are attributable to abnormal CSR. The allele encoding the 
hMSH5 L85F / p786S variant is significantly more frequent in IgAD patients and with borderline 
significance in CVID patients. One HMSH5 SNP (rs3131378), located within intron 12, is also 
tightly associated with IgAD and CVID. Furthermore, CVID patients carrying the 
hMSH5 LS5V / p7S6S allele display increased microhomology at Sp-Sal joints, which commonly 
associate with far fewer mutations (Sekine et al. 2007). The L85F and P786S alterations, 
located within the hMSH4-interacting domain, significantly compromise the protein 
interaction between hMSH5 and hMSH4, suggesting both hMSH5 and hMSH4 are required 
for efficient CSR (Sekine et al. 2007). 

In mice, however, the effects of Msh5 on CSR seem to be strain-specific. In an earlier study, 
most of the congenic H-2 b / b MRL /Ipr mice (introgression of H-2 b MHC haplotype 
froml29/Sv onto a MRL /Ipr background) show undetectable serum IgG3, reduced levels of 
serum IgA and elevated levels of serum IgM and IgG2a in older mice, suggesting impaired 
CSR in these animals. Microarray analysis of gene expression reveals a hypomorphic allele 
of Msh5 on the H-2 b haplotype. Comparing to controls, H-2 b / b MRL /Ipr mice without serum 
IgG3 (IgG3 ne s) exhibit significantly increased microhomology at Sp-Sy3 ( i.e. IgM to IgG3 
switch) and Sp-Sa (i.e. IgM to IgA switch) joints in splenic B cells (Sekine et al. 2007). This 
phenotype of increased microhomology is also observed in B cells from Ms/ 75 -null FVB mice 
(de Vries et al. 1999) and Msh4 knockout mice on C57BL/6 background (Sekine et al. 2007). 
These studies indicate that both MSH5 and MSH4 are involved in CSR and may function as 
a heterodimer. Interestingly, studies performed with Msh5-mill C57BL/6 mice show no 
detectable alterations in the length of microhomology (Guikema et al. 2008). It is conceivable 
that the different effects of Msh5 deficiency on CSR in these two mouse strains might be 
attributable to different levels of Msh5 expression, of which MRL/Ipr is a high Msh5 
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expresser, whereas C57BL/6 expresses very low levels of Msh5, and the difference between 
them is about 100-fold (Sekine et al. 2007; Sekine et al. 2009). It is also important to note that 
these two Ms/t5-null mouse lines have been reported to display different degrees of meiotic 
chromosome pairing defects (de Vries et al. 1999; Edelmann et al. 1999), suggesting that the 
role of Msh5 might be influenced by its associated genetic backgrounds. 

3. Conclusion 

The MMR system has received a considerable amount of attention over the last decade. It is 
now clear that the MMR system is multifaceted and participates in several different 
pathways of DNA metabolism. Among all MMR components, the MutS family of proteins 
plays critical and conserved functions during the initiation phase of mismatch recognition. 
In contrast to those MutS homologues involved in MMR, the functions of MSH4 and MSH5 
are still not fully understood. These two MutS homologues share similar structure and 
sequence features with the other members of the MutS family. Present evidence suggests 
that MSH4 and MSH5 have evolved to function in recombinational DSB repair, DNA 
damage signaling, and immunoglobulin class switch recombination. Although the 
mechanistic details of their involvement in these processes have yet to be elucidated, it is 
conceivable that the actions of hMSH5 in recombinational DSB repair is both hMSH4- 
independent and hMSH4-dependent, presumably coordinating with their functions in 
homology searching and the resolution of recombination intermediate structures. Available 
evidence supports a scenario that the expression of hMSH5 is maintained at a low level 
under normal conditions, whereas DNA damage-elicited hMSH5 induction can promote c- 
Abl activation and the initiation of a p73-mediated caspase 3-dependent apoptotic response. 
On the contrary, moderate hMSH5 induction, caused by less severe DNA damage, is 
expected to facilitate recombinational DSB repair. It is plausible that hMSH4 and hMSH5 
may also play a role in CSR, in particular, through manipulating the process of MMEJ. 

The current and emerging evidence has lent support to the idea that MSH4 and MSH5 are 
involved in diverse functions by engaging different pathways through various interactions 
with different proteins. Undoubtedly, the current knowledge about these two MutS 
homologues has created a solid steppingstone for future exploration of their biological 
functions and potential association with disease conditions in humans. 
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1. Introduction 

Within each viral particle, retroviruses package two copies of a single-stranded RNA 
genome of about 10 kb. All of the viral genomes contain three major genes, arranged in the 
order: 5'-gag-pol-env-3', and some retroviruses may also have accessory genes ( e.g. vif vpr, 
tax, etc...). Structural proteins such as MA (matrix protein), CA (capsid protein) and NC 
(nucleocapsid protein) are encoded within gag. Envelope proteins that mediate viral entry 
(surface and transmembrane glycoproteins) derive from expression of the env gene. Virus- 
encoded enzymes such as the protease, the reverse transcriptase (RT) and the integrase, 
required to complete the viral life cycle, usually derive from the expression of pol. The 
reverse transcription of the viral single-stranded (+) RNA genome into double-stranded 
DNA is an essential step in retroviral replication and an important target for therapeutic 
intervention (for reviews, see Telesnitsky & Goff, 1997; Abbink & Berkhout, 2008; Sarafianos 
et al., 2009). Reverse transcription is a relatively complex process that requires the 
intervention of at least three elements: (i) the viral genomic RNA (that serves as template); 
(ii) a specific primer ( i.e. a transfer RNA); and (iii) the viral RT. Retroviral RTs are enzymes 
that possess two activities: (i) a DNA polymerase activity that uses either RNA or DNA as 
template, and (ii) an RNase H activity, which degrades RNA from RNA/DNA hybrids. 
Unlike eukaryotic DNA polymerases, retroviral RTs are devoid of 3'-> 5' exonucleolytic 
proofreading and show intrinsic error frequencies of around Id 4 to Id 5 , well above the 
values reported for cellular DNA polymerases. Their lower accuracy together with their 
ability to switch templates during reverse transcription are major contributors to the 
extensive genetic variability observed in many retroviruses including human immuno¬ 
deficiency virus type 1 and type 2 (HIV-1 and HIV-2). The diversification of retroviral 
genomes, based on the sequence of the pol gene (encoding for viral enzymes including RT) is 
illustrated in Fig. 1. Environmental factors as well as the molecular structure of retroviral 
RTs modulate their fidelity. In addition, retrovirus genetic variability can be affected by 
viral and cellular proteins. In this review, studies dealing with the molecular basis of fidelity 
of HIV-1 RT are summarized and discussed in the light of crystal structures of the enzyme. 
Structural information has been most useful in the design of antiretroviral drugs targeting 
the DNA polymerase activity of the RT. The last sections of this chapter summarize current 
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knowledge on the molecular basis of antiretroviral drug resistance and the mechanisms 
leading to selection of drug-resistant HIV. 



■ Rodents ■ Reptiles • Insects 

Fig. 1. Dendrogram representing phylogenetic relationships between retroviruses. The 
unrooted neighbour joining dendrogram is based on pol sequences of seven retroviral 
genera (alpha-, beta-, gamma-, delta-, epsilon-, lenti- and spuma-like retroviruses). 
Reproduced from Jern et al. (2005); originally published by BioMed Central. 

2. The process of reverse transcription 

An overview of the reverse transcription process is shown in Fig. 2. Reverse transcription 
initiates after binding of a cellular tRNA primer (in HIV-1, tRNA L y s < 3 ), to the primer binding 
site (PBS). The PBS is a sequence of 18 nucleotides, located downstream of the 5'-end of the 
genomic RNA. The viral RNA that serves as template for reverse transcription is flanked by 
repeat (R) sequences at its 5' and 3' termini. Upon annealing, the tRNA primer is extended 
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up to the 5' end of the genome, generating an intermediate which is known as the minus- 
strand strong-stop DNA ((-)ssDNA). Simultaneously, the RNase H activity of the RT 
degrades the RNA strand annealed to (-)ssDNA, and the (-)ssDNA is released to hybridize 
with the R sequence located at the 3'-end of the same RNA (intra-molecular jump) or with 
an R sequence located at a different viral genome (inter-molecular jump). This step is 
referred to as the first strand-transfer. 


(-) strand synthesis 

(-) strand transfer 


RNA-dependent DNA 
polymerase+RNase H 


(+)strand synthesis 


Removal of RNA 
primers 


(+) strand transfer 


DNA-dependent DNA 
polymerase and strand 
displacement 


Double stranded viral DNA 



Viral genomic RNA 
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U3 R U5 
(-)strand DNA 
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Fig. 2. Overview of the reverse transcription process. 


After binding of the (-)ssDNA molecule to the R sequence, synthesis of the minus-strand 
DNA continues along the viral RNA together with simultaneous degradation of the 
template RNA. Polypurine tracts at the U3 region (3'-PPT) and in some retroviruses, at the 
centre of the genomic RNA (cPPT) resist degradation. These sequences serve as primers for 
synthesis of plus-strand DNA. Plus-strand DNA synthesis originating from the 3'-PPT 
continues to the 5'-end of the minus-strand until it reaches the 18 th nucleotide in the tRNA 
where further synthesis is blocked by a methylated base. Priming from both PPTs involves 
discontinuous DNA synthesis that generates a 99-nucleotide DNA "flap" in the centre of the 
molecule. This product is referred to as plus strong-stop DNA ((+)ssDNA). The second 
strand transfer reaction involves the annealing of the (+)ssDNA to the 3'-end of the full- 
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length (-)strand DNA, through base pairing of the complementary PBS and PBS' sequences. 
This strand transfer reaction is usually intra-molecular (Yu et al., 1998), and renders a 
circular intermediate. 

The strand displacement activity of the RT is required to complete DNA synthesis and for 
the generation of a proviral DNA with duplicated long terminal repeats (LTR) at both ends. 
In the case of HIV-1 and other lentiviruses a "flap" appears in the centre of the DNA. This 
structure is eliminated by the activities of a cellular endonuclease that removes the "flap" 
and a ligase that joins the DNA ends to render the complete provirus that can eventually 
integrate in the host cell genome. 

2.1 Role of the NC protein in reverse transcription 

In most retroviruses, NC is a small nucleic-acid binding protein that derives from cleavage 
of the Gag polyprotein. In HIV-1, it has 55 amino acid residues and contains two conserved 
zinc fingers with the sequence CX 2 CX 4 HX 4 C (CCHC) connected by a short basic peptide 
linker. NC functions mainly as a nucleic acid chaperone (Levin et al., 2010), but it also 
contributes to packaging and dimerization of genomic RNAs, virus assembly, integration 
and reverse transcription. 

NC influences almost every step in the reverse transcription process: (i) it facilitates 
annealing of the tRNA L ys3 to the viral genomic RNA; (ii) basic residues of NC bring the 
complementary tRNA and PBS sequences together, while its zinc fingers destabilize 
secondary structures in the 5' LTR to prevent pausing of RT during reverse transcription; 
(iii) NC enhances minus strand transfer, by accelerating the annealing of 5' repeat cDNA to 
the 3' repeat sequence of the viral RNA genome; and (iv) NC increases RNase H activity, 
thereby enhancing cleavage of the donor template and promoting strand transfer by 
creating free cDNA available for interaction with the acceptor (Thomas & Gorelick, 2008). 
The ability of NC to destabilize secondary structures alleviates pausing and enhances RT 
processivity, leading to the generation of a greater proportion of full-length DNA products. 
NC is also important for removal of non-PPT RNAs, since it inhibits elongation of these 
primers without affecting extension of genuine polypurine tracts (Jacob & DeStefano, 2008). 

2.2 Host and viral factors controlling reverse transcription 

Interactions between RT and other viral proteins and/or cellular factors are likely to occur 
during early and late phases of viral replication. The formation of the reverse transcription 
complex involves packaging of tRNA in virions. This is possible because during assembly. 
Gag and Gag-Pol precursors interact with the aminoacyl-tRNA L Y s - 3 synthetase (LysRS) 
which has bound tRNA L Y s ' 3 (reviewed in Abbink & Berkhout, 2008). RNA helicase A is a 
cellular protein that is able to rearrange RNA structures due to its unwinding activity on 
RNA secondary structures. RNA helicase A activity promotes viral reverse transcription by 
facilitating the accessibility of the RT to the viral RNA (Roy et al., 2006). Other cellular 
proteins such as HuR, AKAP149 and DNA topoisomerase I have been shown to interact 
with retroviral RTs (for a review, see Warren et al., 2009). 

In addition, several factors have shown an influence on the integrity of the viral genome. 
Thus, in HIV-1, the accessory protein Vif (viral infectivity factor) promotes reverse 
transcription while increasing viral infectivity (Carr et al., 2008). In the absence of NC, Vif 
promotes annealing of tRNA L y s A decreases pausing of the RT, destabilizes nucleic acid 
secondary structures, stimulates ssDNA synthesis and increases the efficiency of the first 
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strand transfer event during reverse transcription. Moreover, Vif is also an RNA chaperone. 
In the presence of NC, Vif inhibits NC-induced tRNA L Y s - 3 annealing, RNA dimerization and 
reverse transcription initiation. Taken together, those results suggest that Vif could prevent 
premature initiation of reverse transcription (Henriet et ah, 2007). 

Through the expression of Vif, HIV-1 counteracts the antiviral effect of apolipoprotein B 
mRNA-editing, catalytic polypeptide enzymes (APOBEC3). APOBEC3 family members are 
cellular proteins with cytidine deaminase activity that have anti-HIV-1 activity (Sheehy et 
al., 2002; reviewed in Aguiar & Peterlin, 2008). APOBEC3F and APOBEC3G are 
encapsidated into budding virions. In the absence of Vif, APOBEC3F/G induce 
hypermutation of the HIV-1 genome. In vitro studies have also shown that in the presence of 
NC, APOBEC3G affects tRNA L y s - 3 annealing (Guo et al., 2007), and reduces strand transfer 
and integration (Mbisa et al., 2007). 

2.3 RNase H activity and dynamics of RT/nucleic acid interactions 

As mentioned earlier, retroviral RTs have an endonuclease activity that hydrolyzes the RNA 
strand in RNA/DNA hybrids to generate 5'-phosphate and 3'-hydroxyl ends. Retroviral 
RNase H has two major distinct modes of activity: (i) polymerase-dependent (DNA 3'-end- 
directed cleavage), and (ii) polymerase-independent (RNA 5'-end-directed cleavage). In 
addition, internal cleavage of RNA on RNA/DNA hybrids can also occur in the absence of 
5' or 3' ends. The efficiency of this cleavage depends on the specific nucleotide sequence in 
the vicinity of the cleavage site (for a review, see Schultz & Champoux, 2008) (Fig. 3). 
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Fig. 3. Retroviral RT binding modes to RNA/DNA hybrids and effects on RNase H 
cleavage. The polymerase-dependent mode is characterized by the polymerase active site 
being in contact with the 3' DNA primer terminus. The polymerase-independent mode 
depends on the position of 5'-end of RNA fragments relative to the polymerase active site. 
Nucleotides 15-20 and 13-19, numbered from the 3'-end of DNA and the 5'-end of the RNA, 
respectively, are those susceptible to RNase H cleavage in the proposed models. 
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In the polymerase-dependent mode, the polymerase active site of the RT locates at the 3'- 
end of DNA primer, while the RNase H domain is positioned 15-20 nucleotides away from 
the DNA primer terminus. In this situation, the RNase H makes primary cuts and then 
slides forward to make the secondary cuts 5-8 nucleotides from the 3'-end of DNA primer. 
These cleavages can occur during DNA synthesis and in the absence of dNTPs. Factors that 
decrease the rate of DNA polymerization result in an increased number of RNase H cuts, 
due to RT pausing. The distance from the recessed DNA 3' and the nucleotide sequence in 
the vicinity of the cleavage site are important factors that contribute to the frequency of 
DNA 3'-end-directed cleavages (Basu et al, 2008; Schultz & Champoux, 2008). 
Polymerase-independent cleavages depend on the precise position of the 5'-end of RNA 
fragments relative to the polymerase active site. In this mode, the polymerase domain of the 
RT binds the DNA strand in a site that is near the RNA 5'-end. The RNase H domain is 
positioned 13-19 nucleotides away from the 5'-end of the RNA. The distance from the 
recessed end, the nucleotide sequence in the vicinity of the cleavage site and the accessibility 
of the 5'-end (a gap of 2 or more bases is sufficient for such recognition) are important 
factors influencing RNA 5'-end-directed cleavages (Schultz & Champoux, 2008; Champoux 
& Schultz, 2009; Herschhorn & Hizi, 2010). 

During reverse transcription, the polymerase-dependent mode is used during minus-strand 
DNA synthesis to cleave the RNA strand of the hybrid. However, the rate of DNA 
polymerization is about 10 times faster than the RNase H cleavage rate and therefore, 
insufficient for total RNA degradation and DNA release. Polymerase-independent RNase H 
activity is required to complete the process. Cleavages to generate the PPT primer are 
sequence-specific and can occur internally on RNA/DNA hybrids, without DNA synthesis. 
Polymerization-independent RNase H activity also appears to be important for removal of 
extended tRNA and PPT primers. 

PPT primer removal could occur by an internal cleavage event at the RNA-DNA junction, or 
alternatively by an RNA 5'-end-directed cleavage after extension (Schultz & Champoux, 
2008; Champoux & Schultz, 2009). However, molecular details underlying these events are 
largely unknown. Chemical probing, NMR spectroscopy and single-molecule fluorescence 
resonance energy transfer (FRET) have been recently used to explore molecular aspects 
involved in the communication between RT and the RNA/DNA hybrid (Liu et al., 2008; 
Abbondanzieri et al., 2008; reviewed in Fabris et al., 2009; Gotte et al., 2010). These studies 
revealed that an important factor determining enzyme binding orientation is the backbone 
composition of nucleotides at the 5'-end of the primer. These nucleotides make specific 
contacts with RNase H primer grip residues, and thereby regulate the DNA polymerase 
activity of the RT (Abbondanzieri et al., 2008). These studies also showed that during 
reverse transcription, the RT could alternate binding modes (i.e. polymerase-dependent or 
independent) in the presence of PPT. The switching kinetics can be regulated. Thus, in the 
presence of dNTP, the polymerization binding mode predominates and flipping between 
both orientations decreases (Liu et al., 2008; reviewed in Gotte et al., 2010). 

2.4 Recombination and strand transfer 

Recombination is the major source of genetic variability in retrovirus. Recombination can 
mediate the repair of defective retroviral genomes, increase viral diversity and accelerate the 
spread of beneficial mutations. In HIV-1, recombinant genetic forms derived from virus of 
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different subtypes emerged in individuals infected with viruses of two or more different 
subtypes. Inter-subtype recombination allowed the generation of around 50 different well- 
characterized circulating recombinant forms (CRFs) (for an update, see 
http://www.hiv.lanl.gov/content/sequence/HIV/CRFs/CRFs.html). 

Two relevant properties of retroviral reverse transcription contribute to the high frequency 
of recombination. First, retroviral genomic RNAs are encapsidated in pairs. Second, the 
replication machinery is prone to recombination, since template switching is required to 
complete proviral DNA synthesis (Yu et al, 1998; Ramirez et al., 2008; Onafuwa-Nuga & 
Telesnitsky, 2009). Recent studies have shown that the HIV-1 RT can stabilize short (2- 
nucleotide) duplexes of 3 L overhangs of the primer strand that are annealed to 
complementary dinucleotide tails of DNA or RNA template strands. This RT "clamping" 
activity that anneals RNA and DNA strands could have a role in strand transfer (Oz- 
Gleenberg et al, 2011). 

3. The crystal structures of retroviral RTs 

Although all retroviral RTs share similar activities and derive from expression of the viral 
pol gene, they differ in size and subunit composition (Herschhorn & Hizi, 2010). Lentiviral 
RTs are asymmetric heterodimeric enzymes, as shown for HIV-1 and equine infectious 
anemia virus (with subunits of 66 and 51 kDa), feline immunodeficiency virus RTs (67 and 
54 kDa) and HIV-2 RT (68 and 55 kDa). Viral RTs arise from processing of the Gag-Pol 
precursor and contain DNA polymerase and RNase H domains in the large subunit and the 
DNA polymerase domain alone in the smaller subunit. The function of the small subunit is 
mainly structural. The bovine immunodeficiency virus RT has been expressed in bacteria as 
a 64/51-kDa heterodimer. A similar structure is shared by the human T cell lymphotropic 
virus type I RT that contains subunits of 62 and 49 kDa. The RTs of alpharetroviruses ( e.g. 
avian sarcoma leukaemia virus) are larger heterodimers composed of subunits of 94 and 62 
kDa, where the 94-kDa subunit also contains the viral integrase protein (Hizi & Joklik, 1977). 
On the other hand, the RT of the murine leukaemia virus (MLV) (a gammaretrovirus) is a 
monomer of 75 kDa, containing both DNA polymerase and RNase H domains. Less-studied 
recombinant RTs from mouse mammary tumour virus (66 kDa), bovine leukaemia virus (64- 
80 kDa) and prototype foamy virus (80 kDa) appear to be active as monomers or 
homodimers, although their subunit composition in the virion is unknown. 

Our current knowledge on the structure of retroviral RTs is essentially based on many 
crystal structures of HIV-1 RT alone or in complex with nucleic acid or inhibitors, as well as 
additional information on a few crystal structures of HIV-2 RT and MLV RT. These studies 
have revealed the structure of the two catalytic domains of the RTs (DNA polymerase and 
RNase H), which are separated by a connection subdomain. As found in other DNA 
polymerases, RTs have a structure that resembles a right hand (Fig. 4), with three 
subdomains in the DNA polymerase domain, designated as fingers, palm and thumb. 

3.1 HIV-1 RT 

HIV-1 RT is a heterodimer composed of two subunits known as p66 (560 amino acids) and 
p51 (440 amino acids). The DNA polymerase active site residues (AspllO, Aspl85 and 
Aspl86) are located in the palm subdomain of p66 (Fig. 4). In p66, the palm and connection 
subdomains consist of five stranded p sheets with two a helices on one side, while the 
thumb subdomain is composed of a bundle of four helices (Kohlstaedt et al., 1992; Jacobo 
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Molina et al., 1993). The fingers subdomain contains a mixed p sheet and three a helices. The 
RNase H domain consists of five p sheets flanked by four a helices (Fig. 4). The p66 and p51 
subunits have similar folds but p51 is more tightly packaged. Fingers, palm, thumb and 
connection subdomains fold similarly in both subunits, but their spatial organization 
changes due to the different positioning of the fingers, thumb and palm subdomains. Both 
HIV-1 RT subunits form a large cleft, where the thumb subdomain of p51 and the 
connection subdomains of p66 and p51 form the "floor", and fingers, palm and thumb 
subdomains of p66 provide lateral and apical interactions with the nucleic acid substrate 
(Jacobo-Molina et al., 1993). Active site residues in p66 are exposed to the cleft, but they are 
buried in the 51-kDa subunit (Rodgers et al., 1995). 



Fig. 4. Crystal structure of HIV-1 RT. (A) Ribbon representation of the structure of HIV-1 RT 
showing the fingers, palm and thumb subdomains of p66 in blue, red and green, respective¬ 
ly, the connection subdomain in yellow, the RNAse H domain in orange, and p51 in grey. 

(B) Ribbon representation of HIV-1 RT bound to a double stranded DNA. The template is 
shown in cyan and the primer in pale green. Catalytic Asp residues in the DNA polymerase 
domain are shown as yellow spheres and RNase H catalytic residues (Asp443, Glu478, 
Asp498 and Asp549) are represented as red spheres. (C) Individual structures of p66 (left) 
and p51 (right). Color codes are the same as in panels (A) and (B). Shown structures were 
all obtained with the PyMol molecular viewer (http://www.pymol.org) and Protein Data 
Bank coordinates 2HMI (Ding et al., 1998). 

The structure of the binary complex of HIV-1 RT and double-stranded DNA (Jacobo-Molina 
et al., 1993; Ding et al., 1998) showed that the nucleic acid binding cleft can accommodate 17 
nucleotides between the active sites of the DNA polymerase and the RNase H. Major 
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interactions between the RT and the template-primer are indicated in Table 1. The 
comparison of crystal structures of binary complexes with those obtained with unliganded 
RTs showed conformational changes involving the movement of the p66 thumb subdomain 
away from the fingers subdomain (Jacobo-Molina et ah, 1993, Rodgers et ah, 1995). In 
addition, the bound DNA adopts an A-type conformation in the vicinity of the DNA 
polymerase active site, but a B-like conformation near the RNase H domain. These changes 
in orientation involve a 40° bend of the DNA/DNA complex, near a-helix H in the thumb 
subdomain of p66 (Jacobo-Molina et al., 1993). 


Motif 

RT subdomain 

Residues involved 

Primer grip 

p66 palm (pi2-(?13) 

227-235 

Helix clamp 

p66 thumb (aH-al) 

255-268 (aH)/278-286 (al) 

RNase H primer grip 

p66 connection, 
p51 connection, 
p66 RNase H 

358-361 (p66 connection), 

395-396 (p51 connection), 
473-476, 501, 505 (p66 RNase H) 

Template grip 

p66 fingers, p66 palm 
(p4, aB, P5, p8-aE) 

73-77 (p4), 78-83 (aE), 

86-90 (P5), 141-174 (p8-aE) 


Table 1. HIV-1 RT residues and subdomains interacting with nucleic acid in the binary 
complex (Ding et al., 1998). 


The crystal structure of a binary complex of HIV-1 RT and an RNA/DNA hybrid revealed 
only small differences in comparison with the HIV-1 RT/double-stranded DNA complex 
(Sarafianos et al., 2001). Thus, the distance between the DNA polymerase and RNase H 
active sites is slightly larger (18 nucleotides) in the RNA/DNA complex, with most of the 
contacts between RT and template-primer being maintained. However, the RNA/DNA 
hybrid makes more contacts with the p66 submit of the RT at a-helix I (thumb subdomain), 
P-sheet 5 (palm subdomain) and with residues of the RNase H domain (Sarafianos et al., 
2001; Tuske et al., 2004). Also, a number of contacts between the p51 subunit and the RNA 
template were not detected in the structure having the DNA/DNA substrate (Sarafianos et 
al., 2001). Nucleic acids have similar A-like/B-like conformations in complexes containing 
DNA/DNA or RNA/DNA. However, the transition from the A- to the B-forms generates a 
wider minor groove in the RNA/DNA complex, which together with additional contacts 
between the RNase H primer grip and the RNA template seem to be determinant for the 
RNase H catalytic activity. 

An important milestone towards understanding the mechanisms and nucleotide specificity 
in DNA polymerization by retroviral RTs was the determination of the crystal structure of a 
ternary complex of HIV-1 RT bound to double-stranded DNA and an incoming dNTP 
(Huang et al., 1998). Nucleotide binding facilitates transition from an "open" conformation 
of the fingers subdomain in p66 (as observed in the structure of RT/DNA binary complexes) 
to a "closed" conformation where the p3-p4 hairpin loop in the fingers subdomain moves 
towards the p66 palm subdomain (see review by Sarafianos et al., 2009). This movement in 
the fingers subdomain is known to be the rate-limiting step in the polymerization reaction 
(Sarafianos et al., 2009), and brings amino acid residues Lys65 and Arg72 into close 
proximity with the incoming nucleotide (Tuske et al., 2004). Apart from these two residues, 
other important interactions in the nucleotide binding site are those established between the 
incoming dNTP and RT residues Aspll3-Alall4-Tyrll5-Phell6 and Glnl51, as well as with 
the two divalent cations (probably Mg 2+ ); and between RT residues Tyrl83 and Metl84 and 
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the DNA primer terminus (Huang et al., 1998). Binding of the incoming dNTP also produces 
a movement of the YMDD motif (including catalytic residues Aspl85 and Aspl86) that 
allows proper coordination of the catalytic aspartates with the metal cofactors, and triggers 
the nucleophilic attack of the 3'OH of the primer terminus on the a phosphorous of the 
incoming dNTP (Huang et al, 1998; Mendieta et ah, 2008). This polymerization event 
renders an elongated DNA primer and a pyrophosphate molecule that is released in the 
reaction. Structural data suggest that the YMDD motif acts as a "springboard" supplying 
some of the energy required for translocation (Sarafianos et al, 2002). 

3.2 HIV-2 RT 

HIV-2 RT shares around 60% sequence identity with HIV-1 RT. Despite conservation of the 
cleavage site at the N-terminus of the RNase H domain, the size of the small HIV-2 RT 
subunit is uncertain (for a review, see Herschhorn & Hizi, 2010). There is only one crystal 
structure of HIV-2 RT available, and this was obtained as an unliganded form using the 
complete p68 subunit (559 residues) bound to a smaller subunit that contained only 427 
residues due to degradation of the p68 polypeptide by bacterial proteases (Ren et al., 2002). 
The crystal structure of unliganded HIV-2 RT is similar to that of the HIV-1 RT, but the p68 
thumb subdomain is rotated by 8° relative to the unliganded HIV-1 RT p66 subunit. HIV-2 
RT heterodimers are also more stable than HIV-1 RT heterodimers (Divita et al., 1995). 

3.3 MLV RT 

Although MLV RT is quite different from HIV-1 and HIV-2 RTs (MLV RT is a monomer), 
the basic structure is the same: a right hand conformation with fingers, palm, thumb and 
connection subdomains and a C-terminal RNase H domain (Das & Georgiadis, 2004; Lim et 
al., 2006). Structural analysis revealed that HIV-1 and MLV RTs are rather different at their 
thumb and connection subdomains, while their fingers and palm subdomains show 
significant homology (for a review, see Cote & Roth, 2008). MLV RT is 111 amino acids 
longer than the p66 subunit of HIV-1 RT. Major differences are found at the N-terminus 
which is about 40 amino acids longer in MLV RT, and between the connection subdomain 
and the RNase H domain that contains 32 extra residues in the MLV RT. In the RNase H 
domain of this enzyme, there is a C-helix motif followed by a loop region of 11 amino acids, 
absent from HIV-1 RT. This structure is present in Escherichia coli RNase H. 

4. The copying fidelity of RTs 

The intrinsic fidelity of RTs has a major role in retroviral variability (for a recent review, see 
Menendez-Arias, 2009), although cellular polymerases ( e.g. replicative eukaryotic DNA 
polymerases a/p/5/e and RNA polymerase II) may also modulate the mutation rate in 
retroviruses. Other viral and host factors influencing the retroviral mutation rate include the 
cellular transcriptional machinery, physiological fluctuations of dNTP pools and 
asymmetric error repair. Several retroviruses (e.g. feline immunodeficiency virus, mouse 
mammary tumour virus, etc...) encode a dUTP pyrophosphatase (dUTPase) that prevents 
the incorporation of uracil into the viral genome. This protein is not present in HIV-1, but 
this virus contains a protein (Vpr) that allows for the encapsidation of isoforms of uracil 
DNA glycosylase. This cellular enzyme could contribute to reducing uracil content in the 
nascent viral DNA. In HIV, the virally encoded Vif protein overcomes the activity of 
APOBEC proteins, thereby contributing to maintain a non-lethal level of G->A mutations. 
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The intrinsic fidelity of purified retroviral RTs has been analyzed in vitro by using enzymatic 
(gel-based) or genetic assays (for a review, see Menendez-Arias, 2002). Gel-based assays are 
based on the determination of kinetic parameters for the incorporation of correct and 
incorrect nucleotides on specific template-primers, and provide an estimate of the 
nucleotide selectivity of the DNA polymerase. The relevant kinetic parameters are the k po i 
(nucleotide incorporation rate constant) and the Kd (equilibrium dissociation constant for 
dNTP) (Fig. 5). Their determination should be done under pre-steady-state conditions to 
avoid the contribution of the template-primer dissociation rate (Kellinger & Johnson, 2010). 
Since the fixation of a mutation involves nucleotide misincorporation followed by extension 
of the mismatched primer, similar assays should be carried out with template-primers 
having a mismatch at the 3'-end of the DNA primer, in order to have a better estimate of the 
intrinsic fidelity of RTs. These assays have demonstrated that HIV-1 RT, MLV RT and RTs 
of other retroviruses are one to three orders of magnitude faster in extending mispaired 
template-primers than the dissociation of the retroviral RT from DNA. Reported mispair 
extension efficiencies for HIV-1 and MLV RTs are usually within the range of Id 2 to 1CH, 
while misinsertion ratios range from Id 3 to Id 6 (Menendez-Arias, 2002; Matamoros et al., 
2008; Alvarez et al., 2009). 



Fig. 5. Nucleotide incorporation reaction and example of the determination of kinetic 
parameters k po i and Kd under pre-steady-state conditions. T/P n represents the template- 
primer with n being the nucleotide length, and T/P n+ i represents the same template-primer 
after the incorporation of one nucleotide. PPi stands for pyrophosphate. 


Genetic assays are based on the expression of reporter genes. The most popular assay is 
probably the one based on the use as substrate of a gapped doubled-stranded M13mp2 
DNA duplex that contains the lacZa gene (Bebenek & Kunkel, 1995). A gap-filling reaction 
is carried out in the presence of RT and dNTPs, and mutants are identified after 
transformation of appropriate bacteria by the white/blue color of M13 plaques revealed 
using X-Gal indicator plates. Nucleotide sequence analysis of the M13 DNA obtained from 
white and pale blue plaques allows for the identification of base substitutions, insertions, 
deletions, etc... These assays have shown that the HIV-1 RT is 10 to 20 times less faithful 
than the MLV and avian myeloblastosis virus RTs (Roberts et al, 1989). Since the crystal 
structure of HIV-1 RT is known and this enzyme is an important target of antiretroviral 
therapy, there have been many studies on the effects of amino acid substitutions on the 
copying fidelity of this enzyme. A summary is presented in Table 2. Major structural 
determinants of the accuracy of DNA synthesis by the HIV-1 RT involve dNTP binding 
residues, amino acids that interact with the template-primer or the primer strand, minor 
groove binding track residues and amino acids located at the RNase H primer grip. 
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RT location 

Residue 

Amino acid “ 

Effect on fidelity 


substitution 

Enzymatic 

assays 

Genetic 

assays 

References 

Nucleotide binding 

Lys65 

K65R 

▲ 

A (8.1) 

1-3 

site residues 

K65A 

▲ 

ND 

1 


Arg72 

R72A 

▼/A 

A (1.6) 

4 



A114S 



5 


Alall4 







A114G 

« 


5 



Y115A 

T 

▼ (4-0) 

2,6-8 



Y115F/V 

T 

« 

7-10 


Tyrll5 

Y115L/I/N 

T 

ND 

8 



Y115M/H/G 

T 

ND 

8 



Y115C/S/W 

T 

ND 

7, 8,10 


ninl 

Q151M 

« 

A (1.2) 

11,12 


Vj1JLLJL^/_L 

Q151N 

A 

A (13.1) 

12-14 


pu Q i z;n 

F160Y 


ND 

15 


i neiou 

F160W 


ND 

15 



M184I 

▲ 

A (4.0) 

16-18 


A/Tof 1RA 

Ml 84 A 

T 

ND 

6,19 


iviet io a 

Ml 84 V 

A/T 


6,17, 20-22 



M184L 

▲ 

ND 

6,23 

Located in the RT palm 

Vall48 

V148I 

ND 

► 

00 

24 

subdomain, close to the 






dNTP binding site 

Tyrl83 

Y183F 

▲ 

T (1.6) 

6,23 

Residues that 

Trp24 

W24G 

▲ 

ND 

25 

interact with the 


F61A 

ND 

A (11.7) 

26 

template strand 

1 1I6D1 

F61G 

ss 

ND 

25 


Leu74 

L74V 

▲ 

A (1.7-3.5) 

2, 3, 6, 27 



V75A 

T 

▼ (1-4) 

28 


Val75 

V75F 

▲ 

A (1.8) 

28 



V75I 

▲ 

A (1.8-3.0) 

28,29 


Asp76 

D76V 

▲ 

A (8.8) 

2, 30, 31 


Arg78 

R78A 

ND 

A (8.9) 

2,30 



E89G 

▼/A 

A (1.4-2.0) 

2, 6, 20, 21 


Glu89 

E89K 

ND 

A (1.2) 

32 



E89S 

ND 

A (1.6) 

32 



E89V 

ND 

A (1.3) 

32 

Residues that 

Phe227 

F227A 

▲ 

ND 

33 

interact with the 

Trp229 

W229A 

▲ 

ND 

33 

primer strand 


M230L 

ND 

T 

34 

(P2- p3 hairpin) 

Met230 

M230I 

T 

T 

10,34 


Gly231 

G231A 

T 

ND 

35 


Tyr232 

Y232A 

T 

ND 

35 
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Effect on fidelity 


RT location 

Residue 

substitution 

Enzymatic 

Genetic 

References 




assays 

assays 


Minor groove 

Gln258 

Q258A 

ND 

T (2.0) 

36 

binding track 

Gly262 

G262A 

ND 

T (4.1) 

36,37 

residues 

Trp266 

W266A 

ND 

T (3.0) 

36,37 


Gln269 

Q269A 

ND 

T (2.6) 

36 

a-Helices H and I 

Asp256 

D256A 

ND 

T (1.2) 

36 

(additional thumb 

Lys259 

K259A 

ND 

T (1.5) 

36 

subdomain residues) 

Leu260 

L260A 

ND 

▼ (1-2) 

36 


Lys263 

K263A 

ND 

V (1.5) 

36 

Arg277 

R277A 


A (1.1) 

38 

Gln278 

Q278A 

« 

T (1.2) 

38 

Leu279 

L279A 

s 

A (1.1) 

38 

Cys280 

C280A 

ss 

T (1.9) 

38 

Lys281 

K281A 

« 

A (1.1) 

38 

Leu282 

L282A 


A (1.3) 

38 

Arg284 

R284A 

s 

▼ (1-1) 

38 

Gly285 

G285A 


« 

38 

Lys287 

K287A 


A (1.3) 

38 


Symbols: Y and A indicate a decrease or an increase in RT's accuracy, respectively, »indicates that the 
fidelity of the enzyme is not affected by the mutation. Numbers between parentheses represent the fold- 
change in mutant frequency as determined with the M13mp2 lacZa forward mutation assay. ND, not 
determined. References: (1) Garforth et al., 2010; (2) Mansky et al., 2003; (3) Shah et al., 2000; (4) Lewis et 
al., 1999; (5) Cases-Gonzalez & Menendez-Arias, 2005; (6) Jonckheere et al., 2000; (7) Martin-Hernandez 
et al., 1996; (8) Martin-Hernandez et al., 1997; (9) Boyer & Hughes, 2000; (10) Gutierrez-Rivas & 
Menendez-Arias, 2001; (11) Rezende et al., 1998a; (12) Weiss et al., 2002; (13) Weiss et al., 2000; (14) 
Jamburuthugoda et al., 2005; (15) Gutierrez-Rivas et al., 1999; (16) Rezende et al., 1998b; (17) Hsu et al., 
1997; (18) Oude Essink et al., 1997; (19) Pandey et al., 1996; (20) Hamburg et al., 1998; (21) Drosopoulos 
& Prasad, 1998; (22) Wainberg et al., 1996; (23) Bakhanashvili et al., 1996; (24) Weiss et al., 2004; (25) 
Agopian et al., 2007; (26) Fisher & Prasad, 2002; (27) Rubinek et al., 1997; (28) Matamoros et al., 2008; 

(29) Alvarez et al., 2009; (30) Kim et al., 1998; (31) Kim et al., 1999; (32) Hamburg et al., 2006; (33) 
Wisniewski et al., 1999; (34) Cases-Gonzalez & Menendez-Arias, 2004; (35) Wohrl et al., 1997; (36) Beard 
et al., 1994; (37) Bebenek et al., 1995; (38) Beard et al., 1996. 

Table 2. Effect of amino acid substitutions on the copying fidelity of HIV-1 RT. 

5. HIV-1 RT as a drug target 

Despite the contribution of other RTs to our general understanding of their biochemistry 
and role in retroviral replication, HIV-1 RT has been the most widely studied enzyme due to 
its relevance as a target for the development of antiretroviral drugs. Around half of the 
currently licensed drugs for treatment of HIV-1 infection are RT inhibitors. These drugs can 
be classified into nucleoside RT inhibitors (NRTIs) (Fig. 6) and non-nucleoside RT inhibitors 
(NNRTIs) (Fig. 7). 
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5.1 Inhibitors of the DNA polymerase activity of HIV-1 RT 

5.1.1 Nucleoside RT inhibitors (NRTIs) 

Currently, there are seven NRTIs approved for treatment of HIV-l-infected patients, as well 
as a nucleotide analogue inhibitor, tenofovir disoproxil fumarate, which is a precursor of an 
acyclic nucleoside phosphonate known as tenofovir (Fig. 6). Newly developed NRTIs are 
currently at different stages of drug development (for recent reviews, see Jochmans, 2008; 
Menendez-Arias, 2008; Cahn & Wainberg, 2010). 



Fig. 6. Chemical structures of approved NRTIs: p-D-(+)-3'-azido-3'-deoxythymidine 
(zidovudine, AZT), p-D-(+)-2',3'-didehydro-2',3'dideoxythymidine (stavudine, d4T), p-D- 
(+)-2',3'-dideoxyinosine (didanosine, ddl), p-D-(+)-2',3'-dideoxycytidine (zalcitabine, ddC), 
P-L-(-)-2',3'-dideoxy-3'-thiacytidine, (lamivudine, 3TC), (-)-(lS,4R)-4-[2-amino-6-(cyclo- 
propylamino)-9H-purin-9-yl]-2-cyclopentene-l-methanol (abacavir), p-L-(-)-2',3'-dideoxy-5- 
fluoro-3'-thiacytidine (emtricitabine, FTC), and R-9-(2-phosphonomethoxy-propyl)adenine 
disoproxil fumarate (tenofovir disoproxil fumarate). 

NRTIs are prodrugs that inside the cell will be phosphorylated by cellular kinases to their 
triphosphate forms. The resulting triphosphate derivatives are used as substrates by HIV-1 
RT and incorporated at the 3'-end of the elongating DNA primer. Once incorporated, NRTIs 
act as chain terminators, blocking further DNA synthesis since these inhibitors lack the 
catalytically essential 3'OH group in the ribose ring. The therapeutic efficacy of NRTIs is 
determined by the efficiency of the activation process by cellular kinases (for a review, see 
Bazzoli et al., 2010). The long-term use of those drugs has been associated with a number of 
clinically relevant toxicities such as hyperlactatemia and lactic acidosis, lipoatrophy, etc. 
(Nolan & Mallal, 2004; Hammond et al., 2010). Some of those adverse effects appear to be 
related to the low specificity of the NRTIs, which do not markedly discriminate between RTs 
from different origins (HIV-1, HIV-2, SIV, MLV, etc) or cellular DNA polymerases. Most of 
these toxic effects are related to the inhibition of mitochondrial DNA polymerase y 
(reviewed in Kohler & Lewis, 2007). 

5.1.2 Non-nucleoside RT inhibitors (NNRTIs) 

There are five NNRTIs that are currently approved for treating HIV-1 infections: nevirapine, 
delavirdine, efavirenz, etravirine and rilpivirine (Fig. 7), and others are currently in clinical 
trials (for a recent review, see Jochmans, 2008). Unlike NRTIs, these inhibitors do not need 
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intracellular metabolic activation to inhibit the viral replication. The interaction of these 
compounds with the RT induces conformational changes that impact the catalytic activities 
of the enzyme (Sluis-Cremer et al., 2004). 



NNRTIs inhibit RT through binding to a hydrophobic pocket, located in the palm 
subdomain of the p66 subunit, at a distance of around 10 A from the DNA polymerase 
catalytic site. The hydrophobic pocket is lined by aromatic residues Tyrl81, Tyrl88, Phe227, 
Trp229 and Tyr232, hydrophobic amino acids Pro59, LeulOO, Vall06, Vall79 Leu234, and 
Pro236, and hydrophilic residues LyslOl, Lysl03, Serl05, Aspl32 and Glu224, all of them in 
p66. In addition, two residues in p51 (Ilel35 and Glul38) contribute to NNRTI binding. In 
the crystal structures of unliganded RT, there is no NNRTI binding pocket, but this pocket is 
formed when the inhibitor binds to the RT (Das et al., 2005). These drugs act as non¬ 
competitive inhibitors, binding to an allosteric site and interfering with dNTP incorporation, 
probably by altering the conformation of residues in the vicinity of the active site (Xia et al., 
2007), rather than having a direct effect on phosphodiester bond formation (Spence et al., 
1995). 

5.2 Molecular mechanisms of HIV-1 RT resistance 

Currently, most HIV-infected patients are treated with so-called "highly active antiretroviral 
therapy" (HAART), which consists of combinations of three or more inhibitors usually 
acting on two or more targets ( e.g. RT DNA polymerase, NNRTI binding pocket, viral 
protease, etc.). One of the most successful combinations is made of three RT inhibitors: 
tenofovir, emtricitabine and efavirenz (see Figs. 6 and 7 for chemical structures). HAART 
can efficiently block viral replication, but modest levels of replication can lead to 
development of drug resistance. Although drug-resistant variants may have an overall rate 
of replication lower than the wild-type viruses in the absence of drugs; compensatory 
mutations may emerge to improve the viral replication capacity of these viruses (for 
reviews, see Menendez-Arias et al., 2003; Martinez-Picado & Martinez, 2008). 

5.2.1 Resistance to NRTIs 

Zidovudine (AZT) was the first drug approved by the U.S. Food and Drug Administration 
(FDA), in 1987. Resistance to AZT was soon detected in patients under monotherapy for 6 
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months or longer (Larder et al., 1989a,b). The first AZT resistance mutations identified were 
M41L, D67N, K70R, L210W, T215F/Y and K219Q (Larder and Kemp, 1989). After the 
incorporation of new drugs into anti-HIV therapies, such as zalcitabine (ddC), didanosine 
(ddl) or stavudine (d4T), many other resistance mutations were identified. Interestingly, 
AZT resistance mutations were also found in patients under d4T therapy. These mutations 
were called "thymidine analogue resistance mutations" (TAMs). TAMs reside primarily in 
the palm and fingers subdomain of the RT, relatively far away from dNTP binding site. 
However, several NRTIs resistance mutations affect residues located at the nucleotide 
binding site ( e.g. K65R, Q151M, M184V, etc.) or at residues that interact with the 5' single- 
stranded template overhang (e.g. L74V, V75I, V75T, etc) (Huang et al, 1998). 

HIV-1 becomes resistant to NRTIs by two different mechanisms: (i) by increasing 
discrimination against the triphosphate forms of the drug, or (ii) by increasing the RT's 
ability to remove the NRTIs from blocked DNA primers, through phosphorolysis mediated 
by ATP or pyrophosphate (PPi). These two mechanisms are not mutually exclusive. 

i. Resistance by NRTI discrimination during DNA polymerization 

Mutations that confer resistance through this mechanism reduce the incorporation of the 
NRTI-triphosphate while the RT retains the ability to incorporate natural dNTPs. In these 
cases, resistance is usually associated with a decrease of the catalytic efficiency (fc po i/Kd) for 
NRTI-triphosphate incorporation, through the loss of affinity for the inhibitor (increase of its 
Kd) and/or by a reduction in the NRTI incorporation rate (reduced fc po i). A number of single 
amino acid substitutions found in vivo were associated with this mechanism (Table 3). A 
representative example is M184V/I, which confers high-level resistance to lamivudine (3TC) 
and emtricitabine (FTC). Metl84 is within the highly conserved YXDD motif, and its 
substitution to Val or lie causes a steric hindrance between the side-chain of these residues 
and the sulfur atom present in the ribose ring of 3TC and FTC (Sarafianos et al., 1999; 
Kellinger & Johnson, 2010). 

Combination therapies can select for mutational patterns that confer multidrug resistance. 
Thus, NRTI-resistant viruses containing amino acid changes A62V, V75I, F77L, F116Y and 
Q151M were initially observed in virus isolated from patients receiving AZT and ddl 
(Shirasaka et al., 1995). These mutants exhibited high-level resistance to AZT, ddl and d4T, 
and low-level resistance to abacavir, 3TC, FTC and tenofovir in phenotypic assays. This 
multidrug resistance pattern is known as the "Q151M pathway". Glnl51 interacts with the 
ribose moiety and the nitrogen base of the incoming dNTP (Huang et al., 1998). Q151M and 
the accompanying mutations affect the hydrogen bonding network between the nucleotide 
deoxyribose and the RT, while increasing the relevance of the interaction between the 
enzyme and the 3'OH of the nucleotide (Deval et al., 2002). This allows the mutant RT to 
better discriminate between normal dNTPs (with 3'-OH) and triphosphate derivatives of 
NRTIs (without 3'-OH). 

ii. Excision-mediated resistance. 

The NRTI excision reaction is one of the most important pathways towards the acquisition 
of resistance to antiretroviral drugs. Early biochemical studies revealed that nucleotide 
discrimination was not impaired in AZT-resistant HIV-1 RTs (e.g. in the quadruple mutant 
D67N/K70R/T215F/K219Q) (Carroll et al., 1994; Krebs et al, 1997). However, in the 
presence of a PPi donor, AZT-resistant RTs were able to remove the inhibitor from the 3'- 
end of the primer, thereby allowing DNA synthesis to proceed through phosphorolysis 
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Mutation 

Subdomain 

Resistance 

Mechanism 

Comments 

Ref. 

K65R 

fingers 

ddA, 3TC, 
FTC, 

tenofovir, 
ddl, abacavir 

^pol 

Suppresses AZT 
resistance. 

1-6 



ddC 

kpo\-Kd 



K70E 

fingers 

Tenofovir 

^pol 

Mixed mechanism 
of resistance: 
discrimination and 
excision. 

6,7 

L74V 

fingers 

Abacavir, ddl 

^pol"^d 

Impairs viral 
replication. 
Antagonizes the 
effect of TAMs on 

AZT resistance. 

2,8,9 

V75I 

fingers 

acyclovir 

K d 

Accessory mutation 
of the Q151 
complex. 

Antagonizes the 
effect of TAMs on 
AZT resistance. 

10-12 

V75T 

fingers 

d4T 

K d 

Mixed mechanism 
of resistance: 
discrimination and 
excision. 

1,13, 

14 

Q151M 

palm 

Multidrug- 

resistance 

^pol 

Associated with 4 
mutations: A62V, 
V75I, F77L and 

F116Y. 

3,15- 

18 

M184V 

palm 

3TC, FTC, 
Abacavir 

kpoi-Kd 

It can suppress 

AZT resistance. 

2,3, 

19,20 


References: (1) Selmi et al., 2001; (2) Deval et al., 2004; (3) Deval et al., 2005; (4) Feng et al., 2006; (5) 

White et al., 2006; (6) Sluis-Cremer et al., 2007; (7) Kagan et al., 2007; (8) Miranda et al., 2005; (9) Frankel 
et al., 2005; (10) McMahon et al., 2008; (11) Matamoros et al., 2009; (12) Tchesnokov et al., 2009; (13) 
Petropoulos et al., 2000; (14) Lennerstrand et al., 2001; (15) Kaushik et al., 2000; (16) Deval et al., 2002; 

(17) Ray et al., 2002; (18) Frangeul et al., 2008; (19) Sarafianos et al., 1999; (20) Kellinger & Johnson, 2010. 
For additional references, see Menendez-Arias, 2010. Acyclovir is a guanosine nucleoside analogue that 
inhibits herpes simplex virus replication. In patients co-infected with herpes simplex virus, it can inhibit 
HIV replication. Phosphorylated acyclovir inhibits HIV-1 RT, by terminating DNA chain elongation 
(Lisco et al., 2008). 

Table 3. NRTI resistance mutations which increase the discrimination capacity. 

mediated by ATP or PPi (Arion et al., 1998; Meyer et al., 1999; reviewed in Menendez-Arias, 
2008, 2010). The excision reaction involves a nucleophilic attack of the hydroxyl group 
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substituent at the y-phosphorous of ATP (the PPi donor) on the terminal phosphodiester 
bond of the blocked DNA primer (Matamoros et al., 2005). 

The efficiency of the excision reaction is influenced by several factors such as the nature of 
the template strand, the sequence context in which chain termination occurs, the Mg 2+ 
concentration, or the specific nucleoside analogue that blocks the 3' end of the primer. 
Thymidine analogues (AZT, d4T and ddT) and tenofovir are the best substrates of the 
reaction, while cytidine analogues (3TC and ddC) are removed very inefficiently (Mas et al., 
2002; White et al., 2004; Marchand et al., 2007; Sluis-Cremer et al., 2007). Another important 
parameter for the excision reaction is the positioning of the 3' end of the blocked primer. 
Newly incorporated NRTI, present at the primer site (P) (post-tranlocational complex) 
makes the terminal primer unavailable for excision. Nucleotide excision occurs if the 3' end 
of the primer is located in the nucleotide binding site (N) (pre-translocational complex) (Fig. 
8). The efficiency of this reaction also depends on the presence of a dNTP complementary to 
the next position on the template, that leads to the formation of a "dead-end complex" that 
blocks the excision reaction by forcing the equilibrium towards the post-translocational 
status (for a review, see Gotte, 2006). 



Fig. 8. Model showing the equilibrium between AZTTP incorporation and AZT excision 
from chain-terminated primers. The inhibition of DNA polymerization is favoured when 
the AZT moiety is at the P site (in green) and the next dNTP complementary is able to 
occupy the N site (in purple) to form a "dead-end" complex. Resistance occurs when the PPi 
donor (usually ATP) is able to excise the inhibitor when this is positioned at the N site. 

The recently solved crystal structures of complexes of wild-type and mutant 
M41L/D67N/K70R/T215Y/K219Q HIV-1 RTs bound to double-stranded DNA and the 
product of the excision reaction, AZT adenosine dinucleoside tetraphosphate (Ap 4 AZT, Fig. 
8) have revealed different orientations for the adenosine-monophosphate moiety of Ap 4 AZT 
in the wild-type and mutant RT complexes (Tu et al., 2010). The structures are consistent 
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with an important role of n-n stacking interactions between the adenine ring of Ap 4 AZT and 
the side-chain of Tyr215, while phosphate portions of the excision product seem to be 
stabilized by the side-chain of Arg70 in the mutant enzyme. Secondary TAMs such as D67N 
and K219Q would enhance the ability of the adenosine-monophosphate moiety to interact 
with Tyr215, while contributing to the proper alignment of the phosphates during the 
excision reaction. 

The emergence of HIV variants containing insertions or deletions in the fingers subdomain 
of the RT has been facilitated by the extensive use of NRTIs in the treatment of HIV-infected 
individuals (for review, see Menendez-Arias et ah, 2006). HIV-1 RT mutants with a 
dipeptide insertion (usually Ser-Ser, Ser-Gly or Ser-Ala) between codons 69 and 70 and 
additional TAMs such as M41L, A62V, T69S, K70R and T215Y showed very high levels of 
excision activity with primers terminated with AZT and d4T (Mas et al., 2000; Matamoros et 
al., 2004; Cases-Gonzalez et al, 2007). In addition, deletions of residues at positions 67 or 69 
of the RT have been associated with AZT resistance or hypersusceptibility (Imamichi et al., 
2000; Villena et al, 2007; Kisic et al, 2008). 

A number of mutations have been reported to reduce ATP-mediated excision activity in the 
presence of TAMs. Examples are K65R (White et al., 2006), L74V (Frankel et al., 2005; 
Miranda et al, 2005), V75I (Matamoros et al, 2009), Y181C (Selmi et al, 2003) or M184V 
(Gotte et al., 2000; for a review, see Menendez-Arias, 2008). Finally, mutations in the 
connection subdomain (for example, E312Q, G335C/D, N348I, A360I/V, V365I and A376S) 
or in the RNase H domain of RT (such a Q509L, H539N and D549N) can enhance resistance 
to AZT by altering the balance between excision and template RNA degradation (Nikolenko 
et al, 2005, 2007; Delviks-Frankenberry et al., 2007, 2008; Yap et al., 2007; Ehteshami et al., 
2008), while thumb subdomain polymorphisms (P272A/R277K/T286A) could also 
modulate excision by affecting the interaction of the RT with the RNA/DNA hybrid, 
without involvement of the RNase H activity of the RT (Betancor et al., 2010). 

5.2.2 Resistance to NNRTIs 

More than 40 NNRTIs resistance mutations have been identified in vitro or in vivo 
(Tambuyzer et al., 2009; reviewed in Clotet et al., 2009). Nearly all of these mutations occur 
at the NNRTI binding site and lead to the loss of NNRTI binding affinity by the HIV-1 RT. 
In many cases, the rigid structure of the inhibitors decreases their ability to accommodate 
themselves inside the hydrophobic pocket of the mutated RT. This could be the result of a 
loss of key hydrophobic interactions. For example, mutations involved in resistance to 
delavirdine or nevirapine ( e.g. Y181C, Y188L or F227L) affect interactions between aromatic 
residues in the RT and the NNRTIs. Rigid molecules such as nevirapine or efavirenz are 
prone to development of NNRTI resistance through single amino acid substitutions (e.g. 
K103N). Novel inhibitors (e.g. etravirine) have been designed as more flexible molecules 
that are more resilient to resistance since their binding to the RT is compromised only when 
two or more resistance mutations appear (for recent reviews, see Sluis-Cremer and 
Tachedjian, 2008; Sarafianos et al., 2009; de Bethune, 2010). Mutations outside the NNRTI 
binding site (e.g. N348I or A376S in the connection subdomain) have been recently 
associated with NNRTI resistance (Nikolenko et al., 2007; Yap et al., 2007; Schuckmann et 
al., 2010; Paredes et al., 2011). Their mechanism of action is indirect and relates to their 
effects on RT dimerization and interaction with the template-primer. 
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5.3 Inhibition of the RNase H activity of HIV-1 RT 

RNase H activity offers an additional target for the development of antiretroviral drugs due 
to its essential role in HIV-1 replication. Unfortunately, progress in the development of 
RNase H inhibitors has been relatively slow, due to problems of citotoxycity or limited cell 
uptake, as well as an interest in exploiting other viral targets. As of today, there are no drugs 
of this kind approved for clinical use. RNase H inhibitors that interfere with metal cofactor 
binding include N-hydroxymides, hydroxylated tropolones and diketo acids (for reviews, 
see Jochmans, 2008; Adamson & Freed, 2010; Beilhartz & Gotte, 2010). The interference with 
metal-ion coordination has been demonstrated by the structure of HIV-1 RT complexed with 
P-thujaplicinol (Himmel et al., 2009), a hydroxylated tropolone. Other RNase H inhibitors 
such as napthoquinones, vinologous ureas, small nucleic acid fragments (aptamers) and 
hydrazones show different mechanisms of action (Chung et al., 2010; Gong et al., 2011; 
reviewed in Jochmans, 2008; Beilhartz & Gotte, 2010). For example, the crystal structure of 
HIV-1 RT complexed with an N-acyl hydrazone analogue shows that this drug binds more 
than 50 A away from the RNase H domain, at a novel site between the polymerase active 
site and the NNRTI binding pocket (Himmel et al., 2006). This study suggests that these 
compounds may alter the trajectory of the template-primer inducing structural changes at 
the DNA polymerase primer grip and at the thumb subdomain, thereby preventing RNase 
H-catalyzed cleavage. 

6. Conclusions 

The discovery of reverse transcription forty years ago constituted a major breakthrough 
leading to the birth of modern retrovirology. With the identification of pathogenic human 
retroviruses and most notably HIV-1, the study of reverse transcription has devoted a lot of 
attention. These investigations have led to the development and clinical use of about a 
dozen compounds targeting HIV-1 RT that play a major role in the control of the disease. 
However, further studies should help us to find other targets in the reverse transcription 
process. Novel inhibitors targeting RNase H activity would be a valuable addition to the 
antiretroviral drug armamentarium, and experimental therapeutic approaches such as viral 
decay acceleration induced by mutagenic nucleosides could be promising if they prove to be 
efficient in vivo. In addition, there are important aspects related to retroviral replication that 
should be further studied. Namely, the contribution of viral and cellular proteins to the 
reverse transcription process; how reverse transcription is regulated inside the cell (or in 
virions); how cellular factors associate and control the reverse transcription complex after 
entering the cell and prior to proviral DNA synthesis; and also the dynamics of strand 
transfer that seem to be basic for understanding recombination in retroviruses. 
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1. Introduction 

The role of DNA replication is to successfully transfer all genetic information from parental to 
daughter cells. While this process requires faithful DNA duplication to maintain genome 
integrity, certain errors during DNA replication, coupled with selection and fitness, are critical 
for evolutionary events. Therefore, regulatory mechanisms are involved in achieving a certain 
degree of DNA replication fidelity and satisfying these diverse needs. It is reasonable to 
predict that microorganisms also replicate under tight regulations to accomplish the required 
DNA replication fidelity for selection and fitness. This notion also applies to herpes simplex 
virus type 1 (HSV-1), which replicates DNA with a genomic mutation rate similar to those of 
other DNA-based microbes examined (Drake and Hwang, 2005). Since the HSV-1 genome can 
be easily manipulated, it offers a good model for a variety of studies, including the study of 
DNA replication (reviewed in (Coen, 1996)). This review discusses the current knowledge of 
HSV-1 DNA replication fidelity with an emphasis on the role of DNA polymerase and other 
replicative proteins in regulating replication fidelity. 

2. Antiviral therapy and emergence of resistant mutants 

HSVs are ubiquitous human pathogens, causing significant public health issues. Although 
current antiviral drugs can effectively inhibit HSV replication, the increasing number of 
drug-resistance mutants is becoming a major problem in successful treatment of HSV 
diseases. Resistance to antiviral drugs arises from the development of mutations in viral 
genes that encode the antiviral target proteins. Antiviral drug treatment may provide an 
environment suitable for the emergence of resistant mutants, especially in 
immunocompromised patients, which confer the advantage to survive under antiviral drug 
pressure. Alternatively, spontaneous mutations may arise during drug treatment (reviewed 
in Kimberlin and Whitley, 1996). Although it can be debated that antiviral drugs may induce 
and/or enhance the selection and growth of drug-resistant mutants, it is generally believed 
that replication errors (spontaneous mutations) play a role in the development of mutations. 
Therefore, it is important to understand the regulatory mechanisms, including the 
replication errors and the subsequent repair processes, leading to the development of 
mutations during HSV DNA replication. This information can also be applied to understand 
the general mechanisms of how a cell controls and maintains certain degrees of DNA 
replication fidelity. 
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3. HSV-1 as a model for studying DNA replication fidelity 

HSV-l contains a linear double-stranded DNA genome with a size of 152 kbp, coding for 
approximately 80 open reading frames. HSV can be amended to delete or modify the genetic 
components, regardless of whether the gene is essential or non-essential for growth in 
cultured cells. This offers the advantage that the virus can be modified for a variety of 
biological, biochemical and genetic studies of each gene. For example, recombinant virus 
HP66, which is a HSV-1 polymerase catalytic subunit ( pol ) null mutant and contains a 
reporter gene, P-galactosidase, replaced parts of the pol sequences, had been constructed 
(Marcy, Yager, and Coen, 1990). Since Pol is essential for viral replication, such a 
recombinant virus can only be propagated in cells containing the integrated pol gene and 
expressing the Pol proteins, such as DP6 or Pol A5 cells (Hwang et al., 1997; Marcy, Yager, 
and Coen, 1990). Using this approach various pol mutant viruses, including the lethal 
mutants (Gibbs et al., 1991; Marcy, Yager, and Coen, 1990), can be constructed, propagated, 
and characterized. 

3.1 HSV-1 for the study of DNA replication fidelity 

While biochemical assays can measure the enzyme kinetics and the fidelity of the Pol in 
vitro, examination of the in vivo fidelity of DNA replication is more complicated than the in 
vitro enzymatic studies. Nonetheless molecular biological assays have been developed to 
examine DNA replication fidelity of HSV-1 in infected cells. The information obtained by 
the in vivo experiments together with kinetic data is invaluable for understanding the 
mechanisms regulating the fidelity of DNA replication. Three in vivo mutagenesis assays are 
discussed below. 

3.1.1 TK mutagenesis assay 

The TK (thymidine kinase) mutagenesis assay applies the principle that HSV-1 encoded TK 
enzyme is not essential for virus growth in proliferating cells in cultures and that it can 
specifically activate certain antiviral drugs, leading to the inhibition of viral growth. In 
another words, tk mutants that do not express the TK enzyme or express TK with altered 
substrate specificity can grow and form plaques in the presence of antiviral drug while those 
with functional TK that can activate the drugs fail to form plaques in plaque assay. Based on 
the unique property of the TK enzyme. Hall et al. (Hall et al., 1984) devised the TK 
mutagenesis assay to examine the mutation frequency of HSV-1. 

For example, HSV-1 TK can induce the phosphorylation of the antiviral drug acyclovir 
(ACV), a guanosine analog. Following phosphorylation by TK enzyme, the mono-phosphate 
form of ACV is converted to di- and tri-phosphate ACV by cellular enzymes, allowing its 
incorporation into newly synthesized DNA by HSV Pol. Since ACV lacks the 3'-OH group 
on the acyclic ribose structure, its incorporation into nascent DNA blocks further extension 
from the 3'-terminus and inhibits viral DNA replication. Thus, HSV TK plays an essential 
role in activation of the antiviral drug, and HSV TK mutants that fail to activate ACV will 
become resistance and form plaques on cultured cells in the presence of ACV. 

It is worth noting that the inhibitory effect of ACV triphosphate (ACV-TP) on viral Pol is 
mediated by a three-step mechanism (Reardon and Spector, 1989). In the first step, ACV-TP 
binds to Pol and acts as a competitive inhibitor of dGTP. The interaction between Pol and 
ACV-TP results in the incorporation of ACV monophosphate (ACV-MP) into the growing 
DNA chain, leading to chain termination. However, the inhibitory effect on Pol is dependent 
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on the presence of the next nucleotide complementary to the next template base, which 
freezes the ACV-MP on the primer/template to form the dead-end complex (Reardon and 
Spector, 1989). Examination of certain Pol mutants containing mutations within the 
conserved Pol domain further demonstrate that a Pol resistance to ACV-TP can be due to 
the altered enzyme kinetics of K,„ or K cat , or both. Further information is discussed in (Huang 
et al., 1999). Furthermore, HSV Pol exhibits a much greater affinity for ACV-TP than does 
cellular Pol, which offers the specificity of inhibiting viral DNA replication with the very 
low toxicity of normal host cells (Furman et al., 1979). 

Although tk mutants are easily identified in a simple plaque assay, this method may not be 
sensitive enough to detect all mutants, such as silent mutations and mutations that maintain 
enzyme activity. It is estimated that the TK mutagenesis assay can detect less than 50% of all 
possible substitutions of tk mutations (Hwang, Liu, and Hwang, 2002). Despite this, an 
extensive study (Lu, Hwang, and Hwang, 2002a) of tk mutants derived from wild-type 
HSV-1 strain KOS found that HSV replicates DNA with a genomic mutation rate similar to 
those of other DNA-based microbes (Drake and Hwang, 2005). However, earlier studies 
(Hall et al., 1984; Hall et al., 1985; Parris and Harrington, 1982; Hwang and Chen, 1995) 
demonstrated a roughly 10-fold higher mutation frequency than those (Hwang et al., 1997; 
Lu, Hwang, and Hwang, 2002a) used to calculate the mutation rate. Whether the selection 
methods (the use of antiviral drugs), experimental bias, or other unknown factors, such as 
the virus stocks and/or host cells used, could lead to the differences require further studies. 
Perhaps it is important to examine clinical isolates with limited passages in cell cultures, 
which may avoid the viral adaptation. Nevertheless, the study by Lu et al. (Lu, Hwang, and 
Hwang, 2002a) demonstrates that HSV-1 pol mutants can replicate the tk gene with different 
mutation spectra (see below). 

3.1.2 Lac Z mutagenesis assay 

The lacZ mutagenesis assay has also been applied to examine the mutation frequency of the 
lacZ gene when it is inserted into the viral genome (Hwang et al., 2003; Pyles and 
Thompson, 1994). This assay is based on observing the relative ratio of the number of 
white/light blue plaques over the total number of plaques examined by plaque assay after 
X-Gal staining. The advantage of this assay is its simple procedure of X-Gal staining of the 
plaques formed on cultured monolayers. However, identifying plaques with mutated lacZ 
genes may not be as simple as expected since bias could be derived from cells infected with 
a mixture of viruses containing wild-type and mutated lacZ genes; a similar bias also can be 
observed in the TK mutagenesis assay. This method may not be sensitive enough to detect 
all mutations, similar to the sensitivity issue of the TK mutagenesis assay. Therefore, we 
developed a more sensitive mutagenesis assay, the supF mutagenesis assay to study the 
replication fidelity of HSV. 

3.1.3 SupF mutagenesis assay 

The supF mutagenesis assay (Fig. 1) was first introduced by Seidman and colleagues 
(Seidman et al., 1985) to examine the mutagenic effect of carcinogens in mammalian cells 
using a shuttle plasmid containing SV40 DNA sequences for its replication in SV40- 
permissive cells and an E. coli amplicon for recovering DNA from E. coli. The shuttle 
plasmid contains the supF gene, which is a bacterial suppressor tRNA gene that can serves 
as a mutagenesis marker. The supF gene can suppress the amber codon present in the (3- 
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galactosidase ( lacZ ) gene, forming blue colonies in the presence of X-gal and IPTG in E. coli 
host that harbor the amber mutation in the lacZ gene. Mutated and inactivated supF genes, 
on the other hand, fail to suppress the amber mutation and are unable to express the (3- 
galactosidase enzyme, and thus, produce white colonies. Measuring the ratio of the number 
of white/light blue colonies over the total number of colonies reveals the mutation 
frequency. Furthermore, the short coding sequences (~90 bp) can be easily sequenced to 
identify the mutation. Other advantages of the supF mutagenesis assay include the low 
spontaneous mutation background and that more than 96% of possible changes in the supF 
gene have been demonstrated to become inactive and fail to express blue colonies. 
Therefore, it is a very sensitive assay (reviewed in (Canella and Seidman, 2000)). 

We have modified and inserted the supF amplicon into the tk locus, allowing measurement 
of the supF mutation frequency in the context of the viral genome during viral replication 
(Hwang and Hwang, 2003; Hwang, Liu, and Hwang, 2002; Hwang et al., 2004). 
Furthermore, the supF amplicon can be modified to contain the origin (oh) sequences 
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Fig. 1. SupF mutagenesis assay used in HSV-1 DNA replication fidelity. (A). Map of the 
plasmid DNA used for the construction of recombinant virus. (B). The action of supF tRNA 
in suppression of the amber codon in the lacZ gene in E. coli. In the absence of the supF 
tRNA only the truncated p-galatosidase proteins are translated due to the presence of an 
amber codon in the lacZ gene. In the presence of the supF tRNA the amber codon is 
suppressed by the insertion of a tyrosine residue and leads to the continuous synthesis of 
the full-length p-galatosidase protein. (C). Experimental procedures of the supF mutagenesis 
assay. The top line denotes the linear DNA of HSV-1. The relative locations of the thymidine 
kinase (tk) locus and origins of DNA replication (oriL and oriS) are shown. The components 
of an integrated supF amplicon within the tk locus are shown. The procedures of the 
mutagenesis assay are briefly illustrated. 
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required for HSV-1 DNA replication and applied as a shuttle plasmid for transient DNA 
replication in HSV-infected cells for the mutagenesis assay (Hwang et al., 1999), an assay 
similar to that applied by Seidman et al. (Seidman et al., 1985). The assay requires a minimal 
manipulation of DNA samples, which includes isolation and purification of total infected cell 
DNA, enzyme digestion, and ligation of the SMpF-containing amplicon. The swpF-containing 
amplicon can then be recovered in E. coli for the blue/white selection and the measurement of 
the mutation frequency, again represented as the ratio of the number of white/light blue 
colonies to total number of colonies recovered. Figure 1 illustrates the supF mutagenesis assays 
used for the study of HSV-1 replication fidelity. Studies of recombinant viruses derived from 
the HSV-1 KOS strain and the transient replication assay using the modified shuttle plasmid 
have revealed that the KOS virus replicates the supF genes at a mutation frequency (Hwang, 
Liu, and Hwang, 2002; Hwang et al., 1999), which is consistent with that observed for the supF 
amplicon propagated in mammalian cells (Seidman et al., 1985). 

4. Polymerase activity and DNA replication fidelity 
4.1 Multiple activities of HSV-1 polymerase 

DNA polymerase is the pivotal enzyme involved in DNA replication and fidelity. HSV-1 
DNA polymerase is composed of the catalytic subunit encoded by the polymerase gene (pol) 
and the accessory protein processivity factor (or UL42) encoded by the UL42 gene. The Pol 
subunit is a 1,235 amino acid polypeptide (Gibbs et al., 1988). It shares sequence homology 
with a-, 5-, and prokaryotic DNA Pols (Bemad et al., 1989; Brown, 2004) that contribute to 
the polymerization activity (the polymerase domain) and contains conserved motifs, namely 
Exo I, II, and III, common to other Pols possessing exonuclease activity (Bemad et al., 1989). 
Although a structural study reveals that HSV-1 Pol contains separated polymerase and 
exonuclease domains (Liu et al., 2006), the conserved 8-region C overlaps with the Exo III 
motif and the conserved region IV with the Exo II motif, suggesting the interdependence of 
DNA polymerase and its associated 3'-exonuclease proofreading activity (reviewed in 
(Coen, 1996)). In addition to these two functional components, HSV-1 Pol also contains a C- 
terminal domain that is required for Pol's interaction with the accessory protein UL42 
(Digard and Coen, 1990). Fig. 2 depicts the regions of Pol possessing these functional 
domains or activities. 


i 

t- 


Pol: IV 5 region C II VI III 


VII V 


Exo 


UL42 

Binding 


Fig. 2. Map of HSV-1 Pol polypeptide. The Pol protein is composed of 1,235 amino acids 
(a.a.). The labels above the line depict the relative locations of conserved regions I -VII and 
5-region C, which are conserved among the polymerase domains of other Pol proteins. The 
labels below the line indicate the relative locations of conserved exonuclease motifs (Exo I, 

II, and III) and the UL42 binding domain located at the C terminus. The overlapping regions 
between the conserved polymerase and exonuclease sequences argue for the structural and 
functional interdependence of two activities. 

Genetic and biochemical studies of mutations within the conserved regions of the 
polymerase domains confirm that these sequences are critical for the polymerase activity as 
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mutations within these conserved regions confer altered sensitivity to certain antiviral 
drugs, which are nucleoside or pyrophosphate analogs (Gibbs et ah, 1988; Hall et al., 1984; 
Hall et al., 1985). Regarding viral DNA replication fidelity, the effect of mutations in 
conserved regions of the polymerase domain, including those in both the polymerase and 
exonuclease domains, have been studied and summarized below. 

4.2 Polymerase mutation affects the replication fidelity 

The TK mutagenesis assay demonstrates that mutations in Pol can result in anti-mutator or 
mutator phenotypes (Hall et al, 1984; Hall et al, 1985). Such studies have provided the first 
evidence that HSV-1 Pol can regulate the replication fidelity of viral DNA and that TK 
mutagenesis is a useful means to measure the mutation frequency of HSV-1. Biochemical 
study has also demonstrated that mutant Pol can have the anti-mutator phenotype (PAA r 5) 
and that the improved fidelity can be attributed to the improved nucleotide selectivity (Hall 
et al., 1985). Sequencing has subsequently demonstrated that such a pol mutant contains a 
mutation within the conserved region of the polymerase domain (Gibbs et al, 1988). 
Examination of several tk mutants also demonstrates that the mutant Pol (PAA r 5) replicates 
the tk genes with different mutation types compared with those mediated by wild-type Pol 
(Hwang and Chen, 1995). PAA r 5 mutant Pol generates only frameshift changes, whereas the 
wild-type Pol generates tk mutants of both base substitutions and frameshift mutations at 
1:1 ratio. Therefore, HSV-1 Pol is critical for regulating replication fidelity. 

Enzymatic studies of wild-type and PAA r 5 Pols demonstrate that the mutant PAA r 5 Pol has 
higher K m of nucleoside triphosphates (dNTPs) relative to wild-type Pol (Hall et al., 1985; 
Huang et al., 1999), suggesting that the mutant Pol may have the better selectivity of 
incoming dNTPs. Study of the HSV-1 Pol model, based on the known structure of RB69 Pol, 
also suggests that arginine residue 842 of HSV-1 Pol is in close proximity to the sugar ring of 
dNTPs and that the mutation of arginine to serine (R842S; conserved region III; Fig. 2) in 
PAA r 5 Pol may weaken the interaction of Pol with dNTPs. 

The supF mutagenesis assay further demonstrates that the PAA r 5 mutant replicates DNA 
with an altered fidelity relative to the parental strain KOS virus. This mutant Pol induces 
differences in the type and distribution of base substitutions relative to those by wild-type 
Pol (Hwang, Liu, and Hwang, 2002), suggesting that the amino acid change in PAA r 5 Pol 
results in alterations in its replication fidelity. However, this mutant Pol has a modest 
mutator activity in replicating the supF gene (Hwang, Liu, and Hwang, 2002), which is in 
contrast to the anti-mutator activity observed by the TK mutagenesis assay (Hall et al., 1984; 
Hall et al., 1985; Hwang and Chen, 1995). Perhaps the sequence contents of the target genes 
contribute to the differences, similar to those observed for other Pols (reviewed in 
(Goodman et al., 1993)). Alternatively, the assay method could influence the outcomes of the 
studies (see below). 

The supF mutagenesis study also revealed that another Pol mutant (L774F) has an anti¬ 
mutator phenotype (Hwang et al., 2004). This mutant contains a mutation in conserved 
region VI of the polymerase domain and confers altered drug sensitivities, supporting that 
this conserved region plays a role in forming the structural and functional polymerase active 
site. Studies of mutations in this conserved region of other Pols, together with the structural 
information of RB69 and HSV-1 Pols (Wang et al., 1997; Liu et al., 2006), suggest that 
conserved region VI has a functional role in the HSV-1 Pol interaction with incoming 
dNTPs. In relation to the thumb, palm and finger subdomains of Pol, the L774F mutation, 
with a hydrophobic phenylalanine replacing the leucine residue, may induce a local 
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structure change of the finger subdomain and cause the finger subdomain to adapt a 
different orientation relative to other polymerase subdomains. The slight structural change 
of L774F may affect the relative position of the finger residues and subsequently lead to an 
altered catalytic reaction, binding affinity of dNTP and rate of polymerization, which may 
result in higher DNA replication fidelity (Hwang et al., 2004). Further enzymatic and 
structural studies of the mutant Pol will be necessary to demonstrate the proposed 
mechanism leading to the higher replication fidelity. 

5. Pol’s exonuclease activity and DNA replication fidelity 

The 3'-5' exonucleolytic proofreading activity plays a significant role in maintaining DNA 
replication fidelity. A Pol lacking the proofreading activity replicates DNA with a 10- to 100- 
fold less fidelity than that with such activity (reviewed in (Kunkel, 1988)). 

5.1 Mutator phenotype of HSV-1 Pol with defective exonuclease activity 

HSV-1 Pol has an intrinsic exonuclease activity that is mapped to the N-terminal half of the 
protein (Weisshart et al., 1994) and contains three conserved Exo motifs found in all Pols 
with proofreading activity. Mutations of the highly conserved residues within these Exo 
motifs result in the loss of exonuclease activity accompanied with the altered polymerase 
activity in vitro (Baker and Hall, 1998; Hall et al., 1995; Hwang et al., 1997; Kuhn and Knopf, 
1996). However, the degree and extent of the altered polymerase activity due to the 
mutations within the Exo motifs vary from study to study. For example, the Exo I mutant of 
D368A was found to retain wild-type levels of polymerase activity and to extend from a 
mismatched base at the 3'-end of the primer (Kuhn and Knopf, 1996; Song et al., 2004), while 
another study demonstrated the failure of extension from a mismatched base (Baker and 
Hall, 1998). It is possible that the differences in assay conditions and the primer-templates 
used could lead to such differences. Nevertheless, it is evidenced that a Pol with defective 
exonuclease activity is highly mutagenic (Baker and Hall, 1998). 

Studies of Exo III mutant Pols also demonstrate the roles of conserved residues Y577 and 
D581 on exonuclease activities. Mutants of Y577A, Y577H, D581A, and the double 
substitutions Y577H/D581A are defective in exonuclease activity and impaired in 
polymerase activity (Hwang et al., 1997; Kuhn and Knopf, 1996). The TK mutagenesis study 
demonstrates that Y577H and Y577H/D581A recombinant viruses are highly mutagenic 
relative to the parental wild-type virus (Hwang et al., 1997). On the other hand, the supF 
mutagenesis assay reveals the modest effects of the exonuclease activity on replication 
fidelity (Hwang, Liu, and Hwang, 2002). It is possible that different sequence content of the 
target genes could contribute to the difference of mutation frequency. Alternatively the 
assay method could play a key factor affecting the detection and measurement of mutations. 
For example, the TK mutagenesis assay can only identify the mutants among viable viruses, 
whereas the supF mutagenesis assay examines total DNA being replicated regardless of 
whether the mutations (in essential genes) are lethal. The higher magnitude of the mutation 
frequencies observed in the tk mutagenesis study may be theoretically explained by the fact 
that the tk mutation frequency could be amplified due to the increases of lethal mutations 
that reduces the total number of progeny viruses been analyzed on cell cultures. 
Nevertheless, the exonuclease-deficient Pol mutants replicate both target genes with 
significant difference of mutation spectra compared to those induced by the wild-type Pol 
(Hwang and Hwang, 2003; Lu, Hwang, and Hwang, 2002b). 
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5.2 Exonuclease deficient polymerase induces altered mutation spectra 

Examining tk mutants derived from wild-type strain KOS demonstrates that the majority of 
drug-resistant tk mutants are frameshift mutations (45 out of 66 mutants examined; 67%), 
containing insertion or deletion of one or two nucleotides in the regions containing 
homonucleotide runs. In contrast, the Y7 recombinant virus, which contains the Y577H 
mutation in the Exo III motif, replicates the tk gene with significantly less frameshift 
mutations (21%; p < 0.005) relative to wild-type virus (Lu, Hwang, and Hwang, 2002b). 
Furthermore, although the majority of substitutions are transitional changes in mutants 
derived from both wild-type and exonuclease-deficient viruses, they distribute differently 
(Lu, Hwang, and Hwang, 2002b). These findings imply that HSV Pol can incorporate 
mispaired nucleotides, and most of the misinsertions are corrected by the proofreading 
activity during DNA synthesis. Furthermore, the Y577H mutation not only loses 
exonuclease activity but also alters the polymerase activity as it replicates misinsertions at 
positions distinct from those of wild-type Pol. Only 3 identical base substitutions are found 
among 15 and 52 tk mutations replicated by wild-type and exonuclease negative Pol, 
respectively (Lu, Hwang, and Hwang, 2002b). Theoretically, one would expect to observe 
more identical substitutions in both groups if the exonuclease Pol does not have altered 
polymerase activity. The observation that recombinant Y7 (Y577H) virus also exhibits 
altered drug sensitivities relative to wild-type virus further supports that the Exo III motif 
has a role on the polymerization activity (Hwang et al., 1998) and that the polymerase and 
exonuclease domains of HSV-1 Pol are interdependent structurally and functionally, also as 
suggested by the overlap between conserved 8-region C and Exo III motif (Fig. 2). 

Not surprisingly, supF mutagenesis studies of the Exo III mutant demonstrate modest 
increases of the supF mutation frequency (Hwang and Hwang, 2003). This is sharply 
different from the several hundred fold increases of tk mutation frequency replicated by 
such mutant viruses (Hwang et al., 1997). Presumably the sequence content between two 
target genes and the assay methods, as discussed above, contribute to these differences. In 
agreement with these assumptions, the in vitro mutagenesis study of an Exo I mutant Pol 
using the lacZa reversion mutagenesis assay demonstrates increases of 3- to 18-fold in the 
reversion frequencies, depending on the dNTPs concentrations (Baker and Hall, 1998). 

5.3 Enzyme kinetics of exonuclease deficient polymerase 

Enzymatic studies of steady (and perhaps pre-steady) state kinetics of the Pol protein will 
provide information to illustrate the affinity of Pol on dNTPs within the context of a 
matched or mismatched 3'-terminus and the relative catalytic activity of Pol's incorporation 
of incoming dNTPs. Based on the kinetic data the extension efficiency and misinsertion 
efficiency of the Pol from a primer-template (p/t) can be obtained to indicate a possible 
mechanism of how a Pol discriminates between correct and incorrect nucleotide during the 
polymerization reaction. Although little information of enzyme kinetics is available. Baker 
and Hall (Baker and Hall, 1998) have demonstrated that both wild-type and the Exo I 
mutant Pol D368A have high affinity (low K,„) for correct nucleotide incorporation from a 
matched p/t, and these Pols exhibit extremely high K m (in the order of 3 magnitudes) for 
nucleotide incorporation from a mismatched p/t. Furthermore, the kinetic data demonstrate 
that exonuclease activity is required for removal of the misinserted nucleotides by slowing 
the primer extension (low V max for nucleotide incorporation at the mispaired 3'-termini) and 
reinsertion of the correct base to resume the efficient extension. Thus an enzyme lacking the 
ability to remove the mispaired base significantly reduces the extension efficiency from the 
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mispaired 3'-termini. Consistent with the results discussed above, we observed that both 
Y577H and Y577H/D581A mutant Pols had altered K m , V max , and extension efficiency from 
mispaired 3'-termini dependent on the mismatched 3'-base. Furthermore, both mutant 
Pols exhibited increases in infidelity (fine), determined as the ratio of 
(Km/V ma x)incorrect/(K m /Vmax)correct (unpublished data). Additional information of the kinetic 
approach addressing the mechanism affecting the fidelity of HSV-1 Pol can be found in a 
recent review (Zhu et ah, 2010). 

6. Processivity factor and DNA replication fidelity 

6.1 UL42 functions as a processivity factor 

The UL42 protein was originally identified as a non-specific double-stranded DNA 
(dsDNA) binding protein (Marsden et ah, 1987; Powell and Purifoy, 1976) and was defined 
as a processivity factor for Pol (Gottlieb et al., 1990). Unlike other known processivity 
factors, UL42 binds DNA directly. It also directly interacts with the Pol subunit to form the 
Pol/UL42 heterodimer, which binds more tightly to the p/t-DNA than to single-stranded 
DNA (ssDNA). On the other hand, Pol alone binds more tightly to ssDNA than to p/t-DNA 
(Weisshart, Chow, and Coen, 1999). Therefore, the Pol/UL42 heterodimer exhibits a ~2-fold 
faster association rate and a ~10-fold slower dissociation rate to p/t-DNA than does Pol 
alone. However, the increased affinity and stability of the Pol/UL42 complex to p/t-DNA 
does not alter the elongation rate relative to Pol alone (Weisshart, Chow, and Coen, 1999). 
The increased half-life, but not the elongation rate, of UL42/Pol on p/t-DNA compared 
with the Pol subunit alone could afford a greater ability for the excision of a 
misincorporated base by the exonuclease activity intrinsic to wild-type Pol (Chaudhuri, 
Song, and Parris, 2003). This suggests that, in addition to its processivity, UL42 may enhance 
the fidelity of wild-type Pol in viral DNA replication via the exonuclease activity. 

6.2 Processivity factor and replication fidelity 

Although the effect of UL42 on DNA replication fidelity at the nucleotide level has not been 
examined, studies from other systems demonstrate that the processivity factor can influence 
replication fidelity. For example, the T7 Pol processivity factor thioredoxin can increase the 
fidelity of single-nucleotide insertions by an exonuclease-deficient T7 Pol. It can also 
increase the frequencies of certain base substitutions and deletions (Kunkel, Patel., and 
Johnson, 1994). These findings imply that slippage may occur during enzyme dissociation or 
reassociation and that thioredoxin enhances the processivity of Pol by preventing it from 
dissociating from the p/t, thereby reducing the slippage-mediated insertion mutations. 
Paradoxically, the reduced accuracy of certain base substitutions synthesized by T7 Pol in 
the presence of thioredoxin also implies that it promotes the extension of mismatches and 
misalignments. Therefore, the processivity factor can act as an anti-mutator factor in the 
slippage-mediated insertion mutations and as a mutator factor in base substitutions and 
deletions. Recent studies also demonstrate that the accessory protein gp45 of bacteriophage 
BR69 Pol can modulate site-specific mutation rates in the target gene compared to those by 
the Pol alone, although the overall mutation rate increases only modestly (Bebenek et al., 
2002; Bebenek et al., 2005). The sliding clamp protein PCNA that enhances the processivity 
of replicative DNA Pol in eukaryotes can promote the bypass of DNA lesions (Mozzherin et 
al., 1997; O'Day, Burgers, and Taylor, 1992) and acts as a mutator factor. Studies have also 
demonstrated that mutations in genes encoding PCNA or subunits of replication factor C 
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(RFC) increase mutation frequency in vivo (Chen et al., 1999; Johnson et ah, 1996; McAlear et 
al, 1994; McAlear, Tuffo, and Holm, 1996; Umar et al, 1996). 

Several UL42 mutations that increase or decrease DNA binding without affecting its 
interaction with Pol have been constructed (Komazin-Meredith et al., 2008; Randell et al., 
2005). These studies suggest that UL42 interacts with DNA via interaction between the basic 
charged residues located on the surface of the molecule and the negatively charged DNA. 
Recombinant viruses expressing UL42 with increased DNA binding exhibits impaired 
phenotypes, including the formation of smaller plaques, decrease replication of progeny and 
synthesis of DNA with higher ratio of DNA copies per plaque forming unit (PFU) (Jiang et 
al, 2009). As expected the mutants are more mutagenic than the control virus expressing 
wild type UL42 (Jiang et al., 2009). Similarly mutants with decreased DNA binding also 
have defective phenotypes and are highly mutagenic (Jiang et al., 2007a), and that mutants 
with no detectable DNA binding are deleterious in viral growth and DNA synthesis (Jiang 
et al, 2007b). These studies provide information that UL42 cannot bind DNA too tightly or 
too weakly and that any perturbation of DNA binding can impact virus production and 
replication fidelity. Future studies of the effects of mutation on the enzyme kinetics and on 
the mutation spectra at the nucleotide level will be invaluable for understanding the 
mechanisms attributed to UL42 in regulating DNA replication fidelity. 

7. Nucleotide metabolism enzymes and DNA replication fidelity 

In addition to the TK enzyme, HSV-1 also encodes several genes expressing nucleotide 
metabolism enzymes, including deoxyuridine triphosphatease (dUTPase), uracil-N- 
glycosidase (UNG), and ribonucleotide redutase (RR). The dUTPase can convert dUTP to 
dUMP, which can subsequently lead to an increased dTTP pool and a decreased dUTP pool. 
The UNG enzyme can remove uracil bases, which are resulted from deamination of 
cytosines, on DNA. The activity of RR is to convert ribonucleoside diphosphates to the 
corresponding deoxyribonucleoside diphosphates. Theoretically, these enzymes may act to 
maintain an optimal condition favoring HSV-1 DNA replication. The study of Pyles and 
Thompson (Pyles and Thompson, 1994), however, demonstrates intriguing results of how 
viruses lacking the expression of these enzymes affect the replication fidelity in infected 
cells. Using recombinant viruses containing a deletion of tested genes and the lacZ reporter 
system the viral dUTPase is shown to have an antimutator activity, whereas the viral TK has 
the mutator activity. While the UNG- viruses replicated the lacZ gene with no significant 
difference of the mutation frequency relative to that of UNG + recombinants, continuous 
passage of UNG- viruses lead an increased mutation frequency, suggesting its antimutator 
activity. Furthermore, the interaction between UNG and Pol and the association of UNG 
with viral replisome also suggest a role UNG in regulating DNA replication fidelity (Bogani 
et al., 2010). There is no information whether HSV-1 encoded ribonucleotide redutase plays 
a role on replication fidelity. 

In contrast to the observed mutator activity of the TK enzyme (Pyles and Thompson, 1994), 
we demonstrated that the TK enzyme of strain KOS does not possess the mutator activity; 
recombinant viruses harboring either the lacZ or supF gene replicate both genes with equal 
fidelity in HSV-1 TK expressing or non-expressing cells (Hwang et al., 2003). Although it is 
not clear what the factor leads to the difference, one may wonder why the virus were 
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encoding a mutagenic gene. However, it is also important to note that each recombinant virus, 
including those used to establish the baseline relative mutation frequency, constructed by 
Pyles and Thompson (Pyles and Thompson, 1994) harbors the integrated lacZ gene at different 
location in the viral genome, which may affect the replication fidelity (see section 8.2). Further 
studies using recombinant viruses containing the reporter gene integrated at the defined 
location and with or without the expression of gene to be examined are necessary to 
demonstrate the effects of these nucleotide metabolism enzymes on replication fidelity. 

8. Other factors may affect the replication fidelity 

8.1 Single-stranded DNA binding protein and replication fidelity 

Little information is available regarding the contribution of other accessory proteins in DNA 
replication fidelity. However, a recent in vitro study of DNA replication fidelity using RB69 
Pol with and without accessory proteins, including single-stranded DNA binding protein 
(SSB), the processivity clamp, and the clamp-loading proteins, demonstrated that these 
proteins induced altered mutation rates in a site-specific manner, although the overall 
mutation rate of the target gene did not differ significantly relative to that by Pol alone 
(Bebenek et al., 2002; Bebenek et al., 2005). Thus, SSB protein and the other accessory 
proteins also play a role on replication fidelity. 

Our previous study examining the effect of an exonuclease-deficient Pol mutant, Y7(Y577H), 
on replication fidelity suggests the possibility that other DNA replicative proteins may have 
effects on regulating the replication fidelity (Hwang and Hwang, 2003). The mutagenic 
ability of the exonuclease-deficient Y7 Pol is expected to generate various mutations in the 
entire genome, including genes coding for Pol and other replicative proteins, and result in 
heterogeneous populations in viral stocks. Examining the pol gene sequences confirms the 
presence of heterogeneous pol sequences in a single viral stock. Marker rescue experiments 
in which the mutated pol is rescued to the wild-type sequences demonstrate that several 
rescued recombinants exhibit higher fidelity than those with wild-type Pol. These allow one 
to test the hypotheses that there are mutations in other replicative genes, which may 
contribute to the higher replication fidelity of the supF gene. It is also important to explore 
the mechanisms by which SSB may regulate replication fidelity in infected cells since this 
protein associates and colocalizes with cellular DNA repair and recombination proteins 
(Taylor and Knipe, 2004; Wilkinson and Weller, 2004) and the SSB protein is able to promote 
recombination in vitro (Nimonkar and Boehmer, 2002; Nimonkar and Boehmer, 2003; 
Nimonkar and Boehmer, 2004; Reuven et al., 2004). Along this line the contributions of other 
cellular replicative proteins in HSV DNA replication warrant for future studies. 

8.2 Can the position of target gene influence replication fidelity? 

The genomes of all alpha-herpesviruses, including HSV-1, are composed of unique long 
(UL) and short (US) sequences and repeated sequences that segment the unique regions. 
Study of proteins conserved among six alpha-herpesviruses reveals that US proteins are 
more divergent than those encoded by UL sequences (Brown, 2004). The study further 
indicates that proteins encoded by sequences at the repeat regions, UL1, UL56, US1, and 
US12, are highly divergent than those of other regions (Brown, 2004). This observation 
suggests that selective sequences within the viral genome have a comparatively rapid 
evolutionary rate or high mutation rate. It is possible that the virus has evolved to arrange 
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the sequences in clusters that can be the targets of different mutation rates. If this is the case, 
then one may speculate that HSV may have position-dependent DNA replication fidelity, 
which we have described as the position effect (Hwang et al., 2003). This may also explain 
that the proteins clustered in the middle of the UL regions (between UL27 and UL33) are 
highly conserved as they are essential for viral replication. However, two proteins required 
for viral DNA replication, UL42 and UL8, that encode a helicase/primase subunit are highly 
divergent. Perhaps, these essential proteins quickly evolved to have multiple functions to 
include differential activity critical for each virus's life cycle. 

8.3 Sequence context, replication mode and replication fidelity 

Consistent with other studies (reviewed in (Goodman et al, 1993), tk and supF mutagenesis 
assays also demonstrate that the sequence content affects HSV-1 replication fidelity. We 
demonstrate that the supF mutation spectra observed in the plasmid-borne replication assay 
differ from those replicated as a part of viral genome (Hwang, Liu, and Hwang, 2002; 
Hwang et al., 1999). The differences could be related to the mechanism of how DNA is 
replicated in infected cells. For example, viral genome replication involves the formation of 
head to tail arranged DNA concatemers and branched structure, presumably mediated by a 
recombination mechanism; the processing and cleavage of complex DNA molecules are 
required for DNA packaging and maturation of viral particles (Bataille and Epstein, 1997; 
Slobedman, Zhang, and Simmons, 1999). On the other hand, the on-based plasmid may not 
replicate by the same mechanisms. It is reasonable to hypothesize that the lack of viral repeat 
sequences in the oh-containing plasmid DNA may not form isomers similar to those observed 
in genomic DNA replication, and that the mechanisms involved in branch DNA formation 
may not be obvious. Further studies are required to determine whether the plasmid-borne 
DNA is replicated with a different mechanism relative to viral genomic DNA replication. 

9. Conclusions 

A large body of literature discusses DNA replication fidelity in both prokaryotic and 
eukaryotic cells, yet limited information is available regarding how HSV DNA replication is 
regulated to maintain a certain degree of fidelity. Understanding the mechanisms regulating 
HSV DNA replication fidelity may lead to comprehension of how drug resistant mutants 
develop. The proteins that regulate replication fidelity definitely play roles in HSV DNA 
replication. In the aspect of antiviral therapy, the knowledge gained from such studies may 
lead to the design of new strategies to combat HSV diseases. 
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1. Introduction 

Human cytomegalovirus (HCMV) is a member of the betaherpesvirus family. Like all 
herpesviruses, HCMV is an enveloped, double-stranded DNA virus. The genome of HCMV 
is 240,000 bp in size with at least 150 known open reading frames (ORFs) (Dunn, Chou et al. 
2003). A majority of the ORFs are nonessential for viral replication in cell culture. These 
nonessential ORFs likely encode proteins with redundant functions or proteins that may be 
required for replication in the human host. In addition, several ORFs are beneficial but not 
required for replication. However, approximately one-quarter, or 41 ORFs, are absolutely 
required for viral replication (Yu, Silva et al. 2003). The UL44 gene is essential. 

During productive infection, HCMV genes are expressed in a temporal cascade, designated 
immediate early (IE), delayed early, and late. The major IE genes (MIE) UL123/122 
(IE1/IE2) play a critical role in subsequent viral gene expression and the efficiency of viral 
replication (Meier and Stinski 1997; Meier and Pruessner 2000; Meier, Keller et al. 2002; 
Isomura and Stinski 2003; Isomura, Tsurumi et al. 2004; Isomura, Stinski et al. 2005). The 
IE72 protein, the predominant product of the IE1 transcript, is encoded by exons 2 and 3 
spliced to exon 4. The IE86 protein, the predominant product of the IE2 transcript, is 
encoded by exons 2 and 3 spliced to exon 5. Translation of the IE1 and IE2 transcripts begins 
in exon 2. The IE72 protein is not essential for viral replication at high MOI, but the IE86 
protein is essential (Marchini, Liu et al. 2001). The early viral genes encode proteins 
necessary for viral DNA replication (Pari and Anders 1993). Following viral DNA 
replication, delayed early and late viral genes are expressed which encode structural 
proteins for virion production. 

The UL 44 protein (pUL44), which binds double-stranded DNA, is an essential accessory 
protein for viral DNA replication and interacts specifically with the viral DNA polymerase 
encoded by UL54 (Pari, Kacica et al. 1993; Ripalti, Boccuni et al. 1995). pUL44 increases 
processivity of the polymerase along the viral DNA template (Ertl and Powell 1992; 
Weiland, Oien et al. 1994; Zuccola, Filman et al. 2000). The accumulation of the pUL44 is to 
strikingly high levels at late times after infection (Stinski 1978; Geballe, Leach et al. 1986). Its 
late kinetics of transcription and the high level of expression suggest an additional 
important role for viral replication. pUL44 is phosphorylated by the viral UL97 protein 
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kinase (pUL97) in infected cells (Krosky, Baek et al. 2003). Phosphorylation by pUL97 is not 
required for pUL44 to interact with the catalytic subunit of the viral DNA polymerase (Ertl 
and Powell 1992; Weiland, Oien et al. 1994). 

The HCMV UL44 transcription emit initiates at three distinct sites, which are separated by 
approximately 50 nucleotides and are differentially regulated during productive infection. 
Two of these start sites, the distal and the proximal, are active at early times, whereas the 
middle start site is inactive until late times (Leach and Mocarski 1989). Expression from the 
late start site is dependent upon viral DNA replication. 

We investigated whether the late start site is necessary for efficient viral DNA replication in 
human fibroblast cells. 

2. Distinct regulation of HCMV DNA polymerase prosessivity factor 

Transcriptional regulation of UL44 gene expression occurs at two levels. The two 
transcription start sites that are activated early in infection presumably respond to IE 
proteins. Each contains a conventional TATA element, which is the only region of significant 
homology between start sites 1 and 3. Late in infection, sequences that are dependent on 
DNA replication result in transcriptional activation at start site 2 (Leach and Mocarski 1989). 
This is referred to as a late viral promoter. 

2.1.1 Effect of the TATA elements on UL44 transcription 

To determine the role of the late UL44 promoter, we constructed recombinant viruses with 
the distal-early or the middle-late UL44 TATA elements mutated (dlTATAl, or dlTATA2, 
respectively) (Fig. 1). 



Fig. 1. Structure of recombinant HCMV BAC DNAs. Recombinant viruses with TATA 
element mutations. UL44 transcription initiates at three distinct sites (labeled as start site 1, 
2, and 3), which are separated by approximately 50 nucleotides. Two of these start sites, the 
distal and proximal, were active at early times after infection, whereas the middle start site 
was inactive until late times. The distal or middle TATA element was replaced by KanR 
with FRT sequence flanking. KanR was excised by FLP-mediated recombination. 
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To detect all the transcripts derived from the different start sites, RNase protection assay 
was employed. Three major transcripts initiating at the spatially distinct start sites were 
detected 2 and 3 days after infection with wild type virus (wt) (Fig. 2). Consistent with a 
previous report (Leach and Mocarski 1989), the middle transcript was not detected with wt 
and dlTATAl in infected cells maintained in the presence of phosphonoacetic acid (PAA), 
which is an inhibitor of HCMV DNA synthesis for 48 h (Fig. 2, lanes: 12 and 13), indicating 
that the TATA2 element was activated upon viral DNA replication. Substitution of the 
TATA2 element caused loss of the late transcript initiating at start site 2. (Fig. 2, lanes: 6 to 
11). The levels of the middle and the proximal transcripts derived from dlTATAl were 
slightly lower than those of wt at 2 d.p.i. (Fig. 2, lanes: 6 and 7). Since levels of the proximal 
transcripts were similar between wt and dlTATAl in the presence of PAA for 48 h. (Fig. 2, 
lanes: 12 and 13), substitution of the TATA1 element affects the transcription initiating at 

d.p.i. 


-^-1 (distal) 


^=2 (middle) 


-^-3 (proximal) 


Fig. 2. Effect of substitution for each TATA element on the UL44 transcription in cells 
infected with wt and recombinant viruses. Cytoplasmic RNAs were harvested at 1, 2, and 3 
days after infection with an MOI of approximately 1. Twenty micrograms of RNA was 
hybridized to 32 P-labeled antisense UL44 promoter probe at 37 °C overnight before 
digestion with RNase Tl. Antisense UL44 probe contains sequence upstream of the 
transcription start site of all the UL44 transcripts. The protected RNA fragments were 
subjected to electrophoresis in denaturing 6% polyacrylamide gels. Lanes: 1, Probe lacked 
RNase Tl; 2, mock; 3, 6, 9, and 12 wt; 4, 7,10, and 13 dlTATAl; 5, 8,11, and 14 dlTATA2; 3 to 
5,1 d.p.i.; 6 to 8 2 d.p.i.; 9 to 11, and 12 to 14, 3 d.p.i.; 12 to 14 in the presence of 
phosphonoacetic acid (PAA). Arrowl, 2, or 3 indicates the transcript initiating at start site 1, 
2, or 3, respectively. 


1 2 3 2 PAA (+) 


■Q 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 
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start site 2 and 3 in a viral DNA replication dependent manner. Since pUL44 is an essential 
viral protein that acts as a processivity factor for the catalytic subunit of the viral DNA 
polymerase (pUL54) (Ertl and Powell 1992; Pari, Kacica et al. 1993; Weiland, Oien et al. 1994; 
Ripalti, Boccuni et al. 1995; Zuccola, Filman et al. 2000), loss of transcription start site 1 was 
responsible for the lower level of viral DNA replication of recombinant virus dlTATAl. 
Taken together, mutation of the distal or middle TATA element reduced the transcription 
from the corresponding start sites. 

2.1.2 The middle element had no effect on viral DNA synthesis 

We compared viral DNA replication of the recombinant viruses with wt at high or low MOI. 
While the relative input of viral genomes for an MOI equivalent to approximately 1 differed 
between 0.001 and 0.0007 for wt, dlTATAl, and dlTATA2 (Fig. 3a), viral DNA of dlTATAl 
was 5 to 10 times lower compared to wt after infection at an MOI of approximately 1 or 0.01 
(Fig. 3a and b, respectively) and dlTATA2 was similar to wt in relative amount (Fig. 3a and 
b, respectively). These results indicate that the reduction of late gene expression in the cells 
infected with dlTATA2 was not due to reduced DNA template for transcription. 

(a) (b) 



Fig. 3. Analysis of viral DNA replication after high or low MOI infection with wt and 
recombinant viruses. HFF cells were infected with wt or recombinant viruses at an MOI of 
approximately 1 or 0.01 and analyzed for viral DNA replication. Viral DNA was quantified 
by real-time PCR with gB primers and probe as described in the Materials and Methods. 
Real-time PCR with 18S primers and probe were also performed to serve as an internal 
control. Data are averages of three independent experiments, (a) HFF cells were infected 
with wt, dlTATAl, or dlTATA2 at an MOI of approximately 1 and harvested at 2, 3, and 4 
d.p.i. Each value, relative to the level of the wt DNA at 4 d.p.i., was calculated and plotted, 
(b) HFF cells were infected with wt, dlTATAl, or dlTATA2 at an MOI of approximately 0.01 
and harvested at 5, and 8 d.p.i. Each value, relative to the level of the wt DNA at 8 d.p.i., 
was calculated and plotted. 
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2.1.3 The distal and middle TATA element had an impact on the viral delayed early 
and late gene transcription 

To compare the overall viral delayed early and late transcription of the recombinant viruses 
with wt, cells were assayed for immediate early, early, delayed early and late gene 
transcripts by Northern blotting at 1, 2, and 3 d.p.i. RNA from infected cells maintained in 
the presence of PAA for 2 d was also assayed. RNA analysis after PAA treatment for 48 h 
did not show a significant difference of IE1 RNAs between wt, dlTATAl, and dlTATA2 (Fig. 
4a, lanes: 7 to 9, and Fig. 4e). In contrast, the level of IE RNAs from dlTATAl was 
approximately two-fold lower than that of wt and dlTATA2 2 and 3 d. p.i (Fig. 4a. lanes: 5 
and 6, and 10 and 11, and Fig. 4e). This might suggest that viral DNA replication modulated 
the IE transcription from the MIE promoter. In the absence of PAA, the amount of UL44 
RNA from dlTATAl was more than two-fold lower than that from wt at 1, 2, and 3 d.p.i. 
(Fig. 4b, lanes: 1 and 2, 4 and 5, and 10 and 11, and Fig. 5e). There was no significant 
difference in the amount of UL44 RNA between wt and dlTATA2 at 1 and 2 d. p.i. (Fig. 4, 
lanes: 1 and 3, and 4 and 6, and Fig. 5e). In contrast, the level of UL44 transcript from 
dlTATA2 was approximately 2-fold lower than that of wt at 3 d p.i. (Fig. 4b, lanes: 10 and 12, 
and Fig. 5e). The reduction of the UL44 transcript in the cells infected with dlTATAl or 
dlTATA2 correlates with the loss of the corresponding early or late transcript initiating at 
start site 1 or 2, respectively (see figure 2). 

The UL99 ORF, which encodes a tegument protein of HCMV, is located in a complex region 
of HCMV with a series of 3'-coterminal transcripts (Wing and Huang 1995). All the 
transcripts utilize a common polyadenylation site downstream of ORF UL99 (Wing and 
Huang 1995). Thus, real-time PCR analysis using the UL99 primers and probe detects the 
total amount of the UL92 to 99 transcripts. The pp28 tegument protein is translated from 
two rnRNAs of 1.6 and 1.3 kb. The 1.3 kb rnRNA is of relatively low abundance (Adam, 
Jervey et al. 1995). Northern blot analysis using the UL99 probe showed that in the presence 
of PAA, the 1.6 kb rnRNA from the UL99 promoter was not detectable, thus UL99 is a true 
late gene (Fig. 4c, lanes: 7 to 9) as reported previously (Adam, Jervey et al. 1995). Consistent 
with the previous report (Wing and Huang 1995), analysis of PAA-treated RNA also 
showed that rnRNAs initiating upstream of each of the potential ORFs in this region 
contained late transcripts (Fig. 4c lanes: 7 to 9). We did not detect differences in the level of 
steady-state rnRNAs between wt and dlTATA2 at 1 and 2 d.p.i. but we did detect it at 3 
d.p.i. (Fig. 4c, lanes: 1 and 3, 4 and 6, and 10 and 12). From a quantitative analysis using real¬ 
time RT-PCR analysis, the amount of the UL92 to 99 transcripts for dlTATAl and dlTATA2 
was more than two-fold lower at 2 and 3 d.p.i, respectively (Fig. 4e). These results indicate 
that the early or late transcripts initiating at start site 1 or 2 of the UL44 gene facilitated 
delayed early and late transcription. 

To determine whether the product from the UL44 late transcript initiating at start site 2 
affected other late gene expression. Northern blot analysis using the UL75, which encodes a 
glycoprotein of HCMV, probe was also performed. Three rnRNAs were detected (Fig. 4d). 
Two transcripts, 4.5 and 5.5 kbp, corresponding to the UL76, and UL77 and UL78 genes, 
respectively, which utilize different transcription start sites as reported previously (Wang, 
Duh et al. 2004). An analysis of the transcripts of the recombinant virus with an UL76 ORF 
deletion determined the transcript from the UL76 promoter in the three rnRNAs. The 
direction of transcription of the UL75 ORF is opposite to that the UL76, 77, or 78 ORFs. 
Thus, real-time RT-PCR analysis using the UL75 primers and probes detects only the UL75 
transcript. The initiation site of UL75 transcript was determined by primer extension as 
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Fig. 4. Analysis of UL44 and the subsequent gene transcription after infection with wt and 
recombinant viruses. HFF cells were infected with an MOI (multiplicity of infection) of 
approximately 1 and cytoplasmic RNA was harvested at 1, 2, and 3 d.p.i. as described in 
Materials and Methods, (a-d) Northern blotting for IE1, UL44, UL99 (pp28), or UL75 (gH). 
28S and 18S ribosomal RNA served as controls for an equal amount of RNA loading. Lanes: 
1,4, 7, and 10, wt: 2, 5, 8, and 11, dlTATAl; 3, 6, 9, and 12, dlTATA2; 1 to 3,1 d.p.i.; 4 to 6, 2 
d.p.i..; 7 to 9, 2 d.p.i.. in the presence of PAA; 10 to 12, 3 d.p.i. (a) Analysis with IE1 probe; 

(b) Analysis with UL44 probe; (c) Analysis with UL99 (pp28) probe; (d) Analysis with UL75 
(gH) probe, (e) Real-time RT-PCR for analysis for IE1/2, UL44, UL92-99 and UL75 (gH) gene 
transcripts. 
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described previously (McWatters, Stenberg et al. 2002). RNA analysis from PAA-treated 
cells for 48 h suggested that the UL75 is a true late viral gene (Fig. 4d, lanes 7 to 9 and long 
exposure: data not shown). Since a small amount of the transcripts from the UL76, UL77 and 
UL78 promoters were detected in PAA-treated cells after long exposure (data not shown), 
UL76, 77, and 78 were delayed early viral genes. Similar to the result using the probe for 
UL99, the level of steady-state mRNAs from the UL76, and the UL77 and UL78 promoters in 
the cells infected with dlTATA2 was lower compared to wt at 3 d.p.i. (Fig. 4d, lanes 10 and 
12). Likewise, the transcript from the UL75 promoter was also approximately 2-fold lower 
for dlTATA2 at 3 d. p.i. (Fig. 4d, lanes: 10 and 12, and Fig. 4e). The steady-state mRNAs 
from UL75, UL76, and UL77 and UL78 promoters in the cells infected with dlTATAl were 
lower at 2, and 3 d.p.i. (Fig. 4d, lanes: 5 and 6, and 10 and 11, and Fig. 4e). Since DNA 
replication of recombinant virus dlTATA2 was similar to wt, reduction in viral gene 
expression was not due to reduced DNA template for transcription. Taken together, the 
UL44 gene product from the late transcript initiating at start sites 2 has an impact on viral 
delayed early and late gene transcription independently of viral DNA replication. 
pUL44 is detected at sites of HCMV DNA replication centers in the nucleus (Penfold and 
Mocarski 1997; Park, Kim et al. 2006). Viral replication centers also serve as foci for viral 
gene expression, presumably in part by concentrating templates for transcription with the 
proteins that carry out or regulate this process. The herpes simplex virus (HSV) single- 
stranded DNA-binding protein, ICP8 is also located at viral DNA replication centers in the 
nucleus and stimulates multiple late viral gene promoters (Gao and Knipe 1991). A recent 
report shows that ICP8 co-precipitates with chromatin remodeling factors (Taylor and Knipe 
2004). Whether or not pUL44 is also associated with ATP-dependent nucleosome 
remodeling proteins on the HCMV genome is not known, but it might be possible that 
pUL44 recruits activating chromatin remodeling factors to late viral promoters at late times 
after infection. 

2.1.4 The distal and middle TATA element reduced accumulation of late viral protein 

We determined whether the reduction of these viral gene transcriptions affected expression 
of the viral late proteins. Compared to wt, the levels of the late viral protein pp28 (pUL99) 
were 16.7 and 7.4 -fold lower for dlTATAl and dlTATA2, respectively (Fig. 5, lanes: 8 and 

9). 

2.1.5 Growth kinetics of the recombinant viruses 

Since accumulation of late viral protein for the recombinant viruses was reduced, we 
compared the growth of the recombinant viruses with wt at low or high MOI. HFF cells 
were infected with wt or recombinant virus at an MOI of approximately 0.01 or 1. Virus 
from infected cultures at 1,5, 8, 11 and 14 d.p.i. or 1, 4, 5, 7, and 9 d.p.i., respectively, were 
assayed as described in the Materials and Methods. Viral growth of dlTATAl or dlTATA2 
was slightly delayed compared to the wt at low and high MOI infection (Fig. 6a and b). We 
detected approximately a 5 to 10-fold difference in viral replication at low MOI infection 
between wt and dlTATA2 at 5 and 8 d.p.i. (Fig. 6a), while DNA replication of dlTATA2 was 
similar at the same time (see figure 3a and b). Likewise, at high MOI infection, the viral titer 
of dlTATA2 was approximately 10-fold lower than that of wt at 4 and 5 d.p.i.. (Fig. 6b), 
while viral DNA replication of dlTATA2 was similar to wt (see figure 3a and b). Therefore, 
insufficient expression of the late gene product of UL44 was linked to delayed viral growth. 
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Fig. 5. Analysis of viral protein expression after infection with wt and recombinant viruses. 
HFF cells were infected with the wt and the recombinant viruses at an MOI of 1 and 
analyzed for viral proteins. Immediate early pIE72 (UL123), pIE86 (UL122), early ppUL44, 
and late pp28 proteins were analyzed 1, 2, and 3 d p.i. Anti-pGAPDH (p36) antibody was 
used to show equal protein loading. Lanes: 1, 4, and 7, wt; Lanes: 2, 5, and 8, dlTATAl; 3, 6, 
and 9, dlTATA2. Lanes 1-3,1 d p.i.; 4-6, 2 d p.i.; 7-6, 3 d p.i. 



123456789 67 

Fig. 6. Analysis of viral DNA replication after high or low MOI infection with wt and 
recombinant viruses. HFF cells were infected with wt or recombinant viruses at an MOI of 
approximately 1 or 0.01 and analyzed for viral DNA replication. Viral DNA was quantified 
by real-time PCR with gB primers and probe as described in the Materials and Methods. 
Real-time PCR with 18S primers and probe were also performed to serve as an internal 
control. Data are averages of three independent experiments, (a) HFF cells were infected 
with wt, dlTATAl, or dlTATA2 at an MOI of approximately 1 and harvested at 2, 3, and 4 
d.p.i. Each value, relative to the level of the wt DNA at 4 d.p.i., was calculated and plotted, 
(b) HFF cells were infected with wt, dlTATAl, or dlTATA2 at an MOI of approximately 0.01 
and harvested at 5, and 8 d.p.i. Each value, relative to the level of the wt DNA at 8 d.p.i., 
was calculated and plotted. 
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2.2.1 Effect of the TATA sequence in the UL44 middle promoter on the kinetics of 
transcription and the start site selection 

What determines the kinetics of the UL44 late transcript start site is unclear. An 
important parameter governing transcription initiation is the relative concentration of the 
viral DNA template and the promoter sequence. As discussed previously (Leach and 
Mocarski 1989), The UL44 early promoters have a canonical TATA sequence of "TATAA". 
In contrast, the viral late or middle TATA element is a noncanonical sequence of 
"TATTATTA" (Fig. 1). To determine the significance of the noncanonical TATA sequence 
on UL44 late gene expression from the middle promoter, we constructed recombinant 
viruses with the UL44 middle TATA sequence of "TATTATTA" mutated to "TATataaA", 
to contain a canonical TATA sequence, "TATAA" and the revertant (wt-R) (Fig. 1). The 
lower case letters indicate the mutated bases. The proximal and distal transcripts were 
similar between wt-R and TATAmut at 1, 2, and 3 d.p.i. (Fig. 7). In contrast, an alternative 
transcript initiating upstream of start site 2 was detected with TATAmut in infected cells 
1, 2, and 3 d p.i. (Fig. 7, lanes 3, 5, and 9). The alternative transcript was not detected with 
wt-R in the presence of PAA at 48 h but it was detected with TATAmut (Fig. 7, compare 
lanes 6 and 7). The transcript initiating at start site 2 was also detected with TATAmut, 
but it was at very low levels at 2 and 3 d p.i. (Fig. 7, lanes 5 and 9). The sequence of the 
UL44 middle TATA nucleotides affects the kinetics and the TSS selection of the UL44 late 
transcript. 

2.2.2 The noncanonical TATA sequence in the UL44 middle promoter is required for 
the accumulation of late transcripts 

To confirm that the noncanonical TATA sequence is required for the accumulation of late 
transcripts, we made adenine and thymidine substitutions to generate recombinant viruses 
TATAmut2 and TATAmut3 (Fig. 8a). As shown in Fig. 3b, when the TATA sequence is 
mutated to contain a canonical TATA sequence "TATAA", the accumulation of late 
transcripts was decreased at 2 and 3 d p.i. (compare lanes 9 with 1 to 8). An alternative 
transcript was detected with TATAmut3 as well as the middle transcripts from the start site 
2 at 2 and 3 d p.i. (Fig. 8b, lanes 5 and 8). These transcripts were not detected in the presence 
of PAA at 2 d p.i. (Fig. 8b, lanes 4 and 6). Since TATAmut contains a repeat of thymine and 
adenine nucleotides in front of TAA in the UL44 middle promoter (see figure 1), the number 
of the TA repeats in front of TAA nucleotides may determine the strength of the UL44 
middle promoter with TATAmut3. While the level of the distal transcript was similar 
between TATAmut2 and TATAmut3, the transcripts derived from the UL44 middle 
promoter with mut2 was lower than those with mut3 at 2 and 3 d p.i. (Fig. 8b: lanes 3, 5, 7, 
and 8). The noncanonical TATA sequence in the UL44 middle promoter influences the 
accumulation of late transcripts. 

The transcriptional strategies of DNA viruses exhibit a number of common features. Prior to 
initiation of viral DNA synthesis, during IE and early phases, infected cells are devoted to 
the production of viral proteins necessary for viral DNA synthesis, efficient expression of 
viral genes, or the other regulatory functions. Transcription of the late genes requires viral 
DNA synthesis. However, the molecular coupling of replication to transcription of late 
genes remains unclear. A part of the newly replicated DNA could serve as templates for 
transcription. Therefore, one hypothesis is that the increased concentration of transcriptional 
templates is necessary for initiation of the late UL44 transcription. The relative weak binding 
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Fig. 7. Effect of the UL44 middle TATA nucleotides on the UL44 transcription in cells 
infected with wt-R and the recombinant virus. Cytoplasmic RNAs were harvested at 1, 2, 
and 3 days after infection with an MOI of approximately 3. Twenty micrograms of RNA was 
hybridized to 32 P-labeled antisense UL44 RNA probe at 37 °C overnight before digestion 
with RNase Tl. Antisense UL44 RNA probe contains sequence upstream of the transcription 
start site of all the UL44 transcripts. The protected RNA fragments were subjected to 
electrophoresis in denaturing 6% polyacrylamide gels. Lanes: 1, lacking RNase Tl; 2, 4, 6, 
and 8, wt-R; 3, 5, 7, and 9, TATAmut; 2 and 3,1 d.p.i.; 4 to 7, 2 d.p.i.; 8 and 9, 3 d.p.i.; 6 and 7, 
in the presence of phosphonoacetic acid (PAA). Arrowl, 2, or 3 indicates the transcript 
initiating at start site 1, 2, or 3, respectively. Allowhead indicates the alternative transcript 
due to the substitution for the UL44 middle TATA nucleotides. 
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affinity of TBP to the noncanonical TATA sequence at the middle promoter may explain a 
lack of transcription at early times after infection. However, the weak binding affinity of 
TBP to the middle promoter is not the only reason for the lack of early transcription because 
late transcripts were not detected with recombinant viruses TATAmut2 or TATAmut3, 
while an alternative transcript was significantly detected with TATAmut3. Late specific 
transcription from a noncanonical TATA sequence may be simply a concentration effect 
after viral DNA synthesis or reflect the presence of a viral transcription factor that 
specifically activates a late promoter. 

The main function of TATA box is to anchor the transcription preinitiation complex guiding 
RNA polymerase upstream of the transcription start site (TSS). Therefore, the spacing 
between TATA box and TSS is functionally important for efficient transcription (Ponjavic, 
Lenhard et al. 2006), but the underlying mechanisms that determine the start site selection 
are not understood. As previously shown (Caminci, Sandelin et al. 2006), the preferred 
canonical sequence for the initiation site is a pyrimidine-purine (PyPu) dinucleotide situated 
at position -1, +1 relative to TSS. When the UL44 middle TATA element was replaced by a 
canonical sequence, the distance between the TATA box and TSS was shorten from 32 or 37 
to 22 nucleotides. Further studies are required to determine the role of a noncanonical TATA 
sequence at the middle promoter of the UL44 transcription unit on the viral late gene 
transcription. 

3. Conclusion 

While pUL44 accumulates to strikingly high levels at late times after HCMV infection, viral 
DNA polymerase does not accumulate. The late accumulation of pUL44 depends on the late 
UL44 promoter, which is required for efficient HCMV delayed early and late gene 
expression. 

4. Materials and methods 

Cells and virus titration. Primary human foreskin fibroblast (HFF) cells were maintained in 
Eagle's minimal essential medium supplemented with 10% fetal calf serum (Sigma, St. 
Louis, Mo.), penicillin (100 U/ml), and streptomycin (100 pg/ml) at 37°C in 5% CO 2 as 
described previously (Stinski 1978). The virus titers of wild type (wt) HCMV Towne and 
recombinant viruses were determined by standard plaque assays on HFF cells as described 
previously (Meier and Stinski 1997). Viral DNA input was determined by infecting HFF cells 
in 35 or 60 mm plates in triplicate, and harvesting the cells at 4 h post infection (p.i.) in PCR 
lysis buffer (lOmM Tris-HCl, pH 8.0, ImM EDTA, 0.001% TritonX-100, and 0.001% SDS) 
containing 50 pg/ml proteinase K. After 55°C for 100 min, the proteinase K was inactivated 
at 95°C for 10 min. The relative amount of input viral DNA was estimated by real-time PCR 
using HCMV gB primers and probes as described previously (Isomura, Tsurumi et al. 2004). 
For analysis of virus growth kinetics, cells were infected at a multiplicity of 0.01 or 1 plaque¬ 
forming unit (PFU)/cell with the wt and the recombinant viruses. At various times after 
infection, whole cells and supernatant including free virus was collected from infected 
cultures and three freeze-thaw cycles were performed before titration. Virus titers were 
determined by the 50% tissue culture infectious dose (TCID 50 ) assay on HFF cells as 
described previously (Nevels, Brune et al. 2004; Nevels, Paulus et al. 2004; Wang and Shenk 
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2005) except GFP foci rather than CPE were counted. We used Reed-Muench method to 
calculate TCID 50 . Wt and the recombinant viruses contain the GFP gene substituted for the 
dispensable, 10-kb US1-US12 region (US, unique short). 

RNase protection (RPA) assay. For construction of the antisense UL44 probe, DNA 
fragment including 5' upstream of the transcription start site of all the UL44 transcript was 
amplified by PCR using primer pairs of UL44 F2 RPA primer (5'- 
CCGCTGGCTCGGCGCGGCTG-3') and UL44 inner primer as described above, and cloned 
into TA cloning vector pCRII (Invitrogen). The resulting clone, pUL44 pro-5 was linearized 
with EcoRV and used as a template for SP6 RNA polymerase. Synthesis by SP6 RNA 
polymerase on linear pUL44 pro-5 DNA produced a 32 P-labeled antisense RNA probe in 
agreement with the predicted size. Cytoplasmic RNAs from mock-infected or HCMV- 
infected HFFs were isolated at the indicated times after infection as described previously 
(Hermiston, Malone et al. 1987; Chang, Malone et al. 1989). DNA replication was inhibited 
with 200 pg/ml phosphonoacetic acid (PAA) (Sigma, St. Louis, MO.) added to the medium 
at the time of infection and maintained throughout infection. Twenty micrograms of RNA 
was hybridized to 32 P-labeled antisense UL44 promoter probe at 37 °C overnight before 
digestion with RNase T1 (100U) as described previously (Lashmit, Stinski et al. 1998; 
Isomura and Stinski 2003). The protected RNA fragments were subjected to electrophoresis 
in denaturing 6% polyacrylamide gels followed by autoradiography on Hyperfilm MP 
(Amersham). 

Northern blot analysis. Twenty micrograms of cytoplasmic RNA was subjected to 
electrophoresis in a 1% agarose gel containing 2.2 M formaldehyde and transferred to 
maximum strength Hybond N+ (Amersham). Northern blot analysis with IE1 probe was 
performed as described previously (14, 15). UL44, UL99 or UL75 DNA was amplified by 
PCR using the primer pairs of UL44 ORFF (5' -TGC AGG AC AT CT CGG ACCTGTCGG-3') 
and UL440RFR (5'-CCAGACGCTGCTCAATTGCGCCG-3'), UL99F (5'- 
TGTGAGTTCGGTACCACGCCCGG-3') and UL99R (5'- 

CGT CT AGGT CGT CCGT CT CCTGC-3'), or UL75F (5'- 

CTGCGAAAAAGATCGGTAGCTGGCC-3') and UL75R (5'- 

CGCTGGACCCTCACGCATTTCACCTA-3'), respectively. A radioactive probe was 
generated by labeling with 32 P-dCTP as described above. 

Real-time RT-PCR analysis. Reverse transcriptase (RT) (Roche Applied Science, 
Penzberg, Germany) was used according to the manufacturer's directions to generate first 
strand cDNA from 2 pg of RNA and 250 ng of oligo-dT primer (Roche) in a final volume 
of 40 pi. Samples were heat-inactivated at 80°C for 5 min. Amplifications were performed 
in a final volume of 10 pi containing PLATINUM Quantitative PCR SUPERMIX-UDG 
cocktail (Invitrogen). Each reaction mixture contained 1 pg of the first-strand cDNA, 5 
mM MgClz, and 500 nM each primer, 250 nM each probe. MIE primers and MIE reporter 
probe were designed as described previously (Meier, Keller et al. 2002). HCMV UL44, 
UL92-99, and UL75 forward and reverse primers and reporter probe were designed using 
Primer Express® (Applied Biosystems) as follows. UL44F: 5'- 

TTTTCTCACCGAGGAACCTTTC-3'; UL44R: 5'-CCGCTGTTCCCGACGTAAT-3'; UL44 
probe: 5'-6-FAM-AGCGTGGCGATCCCTTCGACAA-tetramethyl rhodamine (TAMRA)- 
3'; UL99-255F: 5'-CCACGACGGCTCCAAGAA-3'; UL99-319R: 5'- 

TCGGTTTCGGAGCCTTGTC-3'; UL99-275Tprobe: 5'-6-FAM- 

ACGCGGTGCGCTCGACGTT-TAMRA-3'; UL75-272F: 



174 


DNA Replication and Related Cellular Processes 


(a) 


TATAmut2 




a m»iy) 


TATATAA (1) TATaATTA (?) TAT AAA A {.'3) 


□ |UL'MORF| 


TATAmut3 


RPA probe 


distal 

ijearty) 

-a - 


proximal 
? {early) 


ig 


TATATAA (1) | ATTAfaA (2) TATA AAA (3) 


Q | Ul 44 OMt-1 


HPA probu 


(b) 



Fig. 8. Effect of a canonical TATA sequence in the UL44 middle promoter on the 
accumulation of late transcripts, (a) Shematic representation of the recombinant viruses 
substituted with a canonical TATA sequence, (b) RNAs were harvested at 1, 2, and 3 days 
after infection with an MOI of approximately 3. RNase protection assay was performed with 
32 P-labeled antisense UL44 RNA probe at 37 °C overnight before digestion with RNase Tl. 
The protected RNA fragments were subjected to electrophoresis in denaturing 6% 
polyacrylamide gels. Lanes: 1, 3, 4, and 7 mut2; 2, 5, 6, and 8 mut3; 9 and 10, wt-R; 11, mut; 
12, lacking RNase Tl; 1 and 2,1 d.p.i.; 3 to 6 and 9 to 11, 2 d.p.i.; 7, and 8, 3 d.p.i.; 4, 6, and 
10, in the presence of phosphonoacetic acid (PAA); Arrow 1, 2, or 3 indicates the transcript 
initiating at start site 1, 2, or 3, respectively. Allowhead indicates the alternative transcript 
due to the substitution for the UL44 middle TATA nucleotides. 
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5'-TCCATATGCCT CG AT GT CTTTT -3'; UL75-339R: 5'- 

GGT C AG AT CT ACCT GGTTC AG A A AC-3'; UL75-296Tprobe: 5'-6-FAM- 

TTGGGCAACCACCGCACTGAGG-TAMRA-3' (Nihon Gene Research Laboratories Inc., 
Sendai, Japan). Thermal cycling conditions were an initial 50°C for 2 min and 95°C for 2 
min, followed by 40 cycles of 95°C for 15 sec and 60°C for 1 min. Quantitation of relative 
RNA was accomplished according to a standard curve analysis as described previously 
(Meier, Keller et al. 2002). Real-time PCR with G6PD primers and probe as described 
previously (White, Clark et al. 2004) were also performed to serve as an internal control 
for input RNA. Each real-time RT-PCR assays was performed in triplicate and 
standardized to threshold cycle values for each viral RNA from HFF cells infected with 
Rwt crs at 4 d.p.i. 

Viral DNA replication assay. After infection with an MOI of 1 or 0.01, cells were collected 
at 2, 3, and 4 d.p.i. or 5 and 8 d.p.i.., respectively. Cells in 35 mm plates in triplicate were 
suspended in lysis buffer (50mM Tris-HCl, pH 8.0, lOmM EDTA, 1% SDS and 20pg/ml 
RNase A) containing 50 pg/ml proteinase K. The replicated viral DNA was quantitated by 
real-time PCR using HCMV gB primers and probes as described previously (Isomura, 
Tsurumi et al. 2004; Isomura, Stinski et al. 2005). Real-time PCR with 18S primers and 
probe purchased from Applied Biosystems (Foster City, CA.) were also performed to 
serve as an internal control for input DNA. Data are averages of three independent 
experiments. 
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1. Introduction 

Unlike circular genomes in which the conventional replication fork machinery can 
accomplish the copy of the complete molecule, the requirement of a DNA/RNA molecule to 
prime DNA synthesis imposes replication strategies to avoid the loss of genetic information 
contained at the very end of the lagging strand in linear chromosomes since DNA 
polymerases are unable to start de novo DNA synthesis. Thus, once the most terminal primer 
is removed, a short region of unreplicated single-stranded DNA (ssDNA) will remain at the 
end of the chromosome that would eventually lead to a continuous shortening of the 
daughter DNA molecule after successive rounds of DNA replication (the end-replication 
problem). Therefore, it is essential to guarantee replication of the chromosome ends, that 
otherwise would cause cell death. Organisms containing linear genomes have developed 
novel replication strategies to overcome such a problem by either yielding head-tail 
concatemers, most of them making use of terminal redundancies as phages T4, T7 and SPP1, 
or by the circularisation and further rolling circle replication of their chromosomes, as it 
occurs in phage X [reviewed in (Salas & de Vega, 2008)]. In higher eukaryotes telomerase 
extends directly the 3' end, producing an overhanged ssDNA end (Kornberg & Baker, 1992) 
that finally can invade homologous double-stranded telomeric tracts, enlarging and 
protecting chromosome ends (Verdun & Karlseder, 2007). 

Other organisms, as bacteriophages, animal viruses as adenovirus and human hepatitis B 
virus, mitochondrial plasmids, linear chromosomes and plasmids of Streptomyces (Salas, 
1999), as well as several virus infecting Archaea, as halovirus (Bamford et al., 2005; Bath et 
al., 2006), possess replication origins, constituted by inverted terminal repetitions (ITR) 
together with a terminal protein (TP), placed at both ends of their linear chromosomes 
(Salas, 1991). In these cases, the location of the two replication origins allows both strands to 
be replicated continuously, without requiring asymmetric complexes of the replicative DNA 
polymerase with other accessory proteins to control the different mechanics of continuous 
and discontinuous synthesis (Blanco et al., 1989). The TP provides the OH- group of a 
specific serine, threonine or tyrosine to prime initiation of DNA replication from the ends of 
the linear chromosome, circumventing the end replication problem, the TP remaining 
covalently linked to such 5'-termini of the genome (TP-DNA) (Salas, 1991,1999; Salas et al., 
1996). 
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2. The protein-primed replication mechanism 

The development of a soluble in vitro replication system with highly purified proteins and 
TP-DNA from bacteriophage 4>29 of Bacillus subtilis has allowed us to lay the foundations of 
the so-called protein-primed mechanism of DNA replication (Salas et al., 1995; Salas et al., 
1996). 

Figure 1 shows a summary of the protein-priming mechanism of <\>29 DNA replication 
(Salas, 1991). Initiation of replication starts by the formation of a TP/DNA polymerase 
heterodimer that recognises the TP-containing DNA ends which are the origins of 
replication. On the other hand, the formation of a nucleoprotein complex of the t|)29-encoded 
double-stranded (ds)DNA binding protein p6 (DBP) at the DNA ends has been proposed to 
facilitate opening of the latter (see below) and, in the presence of the initiating nucleotide, 
dATP, stimulates the formation of the covalent linkage between dAMP and the OH group of 
a specific serine residue in the TP, catalysed by the 4>29 DNA polymerase (hereafter ift.9 
DNApol). Afterwards, the same polymerase molecule starts the elongation step (DNA- 
primed) of replication. This results in the formation of the type I replication intermediates 
that consist in full-length dsDNA molecules with one or more ssDNA tails of different 
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Fig. 1. Schematic representation of bacteriophage 4>29 TP-DNA replication. Primer and 
parental TP are shown in black and green, respectively. 4>29 DNApol, DBP and SSB are 
coloured in blue, red and yellow, respectively. An scheme of the type-I and type-II 
replication intermediates is shown at the left. See text for details. 
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lengths that are stretched by the binding of the phage-encoded single-stranded DNA 
binding protein (SSB). When the two replication forks, that have been initiated at each DNA 
end, are encountered, type I molecules give rise to two type II molecules that consist of a 
full-length 4>29 DNA molecule in which a portion of the DNA, starting from one end, is 
double-stranded and the portion spanning to the other end is single-stranded. Type II 
molecules are elongated by the 4>29 DNApol with concomitant dissociation of the SSB 
protein to yield two fully replicated cj)29 TP-DNA molecules. The final dissociation of the 
DNA polymerase from each DNA molecule should allow the formation of new 
heterodimers with free TP molecules to initiate a new round of replication. 

These steps and the proteins involved are dissected in the following sections. 

2.1 Replication origins of TP containing genomes 

4>29 is a lytic phage that infects the gram-positive bacterium B. subtilis (Anderson et ah, 
1966). It has a linear dsDNA genome 19.3 kbp long, with a 6 bp ITR (3'-TTTCAT) (Escarmis 
& Salas, 1982; Yoshikawa & Ito, 1982) and a 31 kDa TP covalently linked to each 5' end by a 
phosphoester bond between Ser 232 and 5'-dAMP (the initiator nucleotide) (Salas et al., 1978), 
forming the minimal replication origin. 

Other B. subtilis phages related to tf>29 that also contain linear dsDNA and TP of similar size 
are classified in three groups: 1) ol5, PZA and PZE that belong to the 4>29 group; 2) Nf, M2 
and B103; and 3) GA-1. The DNA of all these phages has a short ITR six nucleotides long (3'- 
TTTCAT) for <|>29, ol5, PZA and B103, eight nucleotides long (3'-TTTCATTC) for Nf and M2, 
and seven nucleotides long (3'-lTT ATCT) for GA-1 (Salas, 1991). 

Phage Cp-1, that infects the Gram-positive bacterium Streptococcus pneumoniae, contains a 
19.3 kbp linear dsDNA (Martin et al., 1996b) with a TP of 28 kDa covalently linked to the 5' 
DNA ends by a phosphoester bond between a still undetermined Thr and 5'-dAMP (Garda 
et al, 1983; Garda et al., 1986). Cp-1 DNA has an ITR of 236 bp with the reiteration 3'-TTT 
(Martin et al., 1996a). 

TP-DNA genomes have also been observed in phages infecting Gram-negative bacteria as it 
is the case of phage PRD1, a member of a family of lipid-containing phages that infect 
Escherichia coli. The 5' termini of the 15 kbp long linear dsDNA of PRD1 (Bamford et al., 
1991) are linked to a 28 kDa TP by a phosphoester bond between Tyr 190 and 5'-dGMP 
(Bamford & Mindich, 1984; Shiue et al., 1991). The TP-DNA of PRD1 and related phages has 
110 bp long ITR and the reiteration 3'-CCCC at the ends. 

Adenoviruses also contain a linear dsDNA genome 36 kbp long with two replication origins 
located at the 100 bp long ITRs. The Adenovirus type-5 origin sequence starts with the 
reiteration 3'-GTAGTA. The 5' ends are covalently linked to the 55 kDa TP by a 
phosphoester bond between Ser 580 and 5'-dCMP [reviewed in (Coenjaerts & van der Vliet, 
1995; Van der Vliet, 1995)]. 

The product of 4>29 early gene 6 is a DBP that binds preferentially to the 4>29 DNA ends 
every 24 nucleotides and with a defined phase, being essential for the activation of the 
initiation of DNA replication (Serrano et al., 1990). This protein has been described as a 
histone-like protein that self-associates into elongated oligomers doughnut-shaped that 
grow into right-handed double-helical filaments (Abril et al., 1999). These filaments form a 
scaffold tightly wrapped by a DNA right-handed superhelix (Serrano et al., 1990) restraining 
its positive supercoiling (Prieto et al., 1988; Serrano et al., 1993b) and helping to open the 
DNA ends, activating initiation of TP-DNA replication most probably by allowing 
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replication origin recognition by the DNA polymerase/primer TP heterodimer (Serrano et 
al., 1993a). In support of this hypothesis, (]>29 DBP is absolutely required for the in vitro 
formation of the TP-dAMP complex at low temperature (Serrano et al., 1994). Our results 
were pioneers in proposing the structural basis for the activation of replication origins, a 
biological event shown later to be universally conserved in both prokaryotic and eukaryotic 
initiators (Clarey et al., 2006; Erzberger et al., 2006; O'Donnell & Jeruzalmi, 2006). 

By site-directed and deletion mutagenesis we showed that the N-terminal region of p6 is 
involved in DNA binding. Specifically, mutation in residues Lys 2 or Arg 6 resulted in p6 
proteins impaired in DNA binding (Freire et al., 1994; Otero et al., 1990). On the other hand, 
residues critical for the dimerization of protein p6, identified by random mutagenesis, are 
He 8 and Ala 44 . In addition to impaired dimer formation ability, mutations at these two 
residues showed reduced DNA binding affinity and they were affected in the initiation of 
4>29 DNA replication. Thus, dimers seem to be the active form of 4>29 DBP for DNA binding 
(Abril et al., 2000). 

2.2 Initiation of (|>29 TP-DNA replication 

The first step in 4>29 TP-DNA replication is the recognition of the replication origins by a 
heterodimer formed by two viral encoded proteins: the replicative DNA polymerase and a 
free molecule of TP. 

2.2.1 ({>29 DNApol 

4>29 DNApol is a monomeric enzyme of only 66 kDa, fully responsible for viral DNA 
replication (Blanco & Salas, 1985a). Based on amino acid sequence similarities and its 
sensitivity to specific inhibitors, we included c{>29 DNApol in the family B (eukaryotic-type) 
of DNA-dependent DNA polymerases (Bernad et al., 1987). As any of them, it accomplishes 
sequential template-directed addition of dNMP emits onto the 3'-OH group of a growing 
DNA chain, with insertion discrimination values ranging from 10 4 to 10 6 and with an 
efficiency of mismatch elongation 10 5 to 10 6 -fold lower than that of a properly paired primer 
terminus (Esteban et al., 1993). In addition, ({>29 DNApol catalyses 3'-5' exonucleolysis, i.e., 
the release of dNMP units from the 3' end of a DNA strand (Blanco & Salas, 1985b), 
showing a preferential degradation of a mismatched primer-terminus, in agreement with a 
role in proofreading of DNA insertion errors that enhances replication fidelity 10 2 -fold 
(Esteban et al., 1994; Garmendia et al., 1992), as it occurs in most DNA replicases. 

An extensive mutational analysis of individual residues contained in regions of high amino 
acid similarity among family B DNA polymerases, as well as the construction of deletion 
mutants (Blanco & Salas, 1996) allowed us to identify the ({>29 DNApol catalytic residues 
required for these activities and those responsible for the stabilisation of the primer- 
terminus at both active sites. As these residues are evolutionarily conserved the results 
obtained with ({>29 DNApol could be extrapolated to most DNA polymerases [reviewed in 
(Blanco & Salas, 1995, 1996)]. Thus, sequence alignments and site-directed mutagenesis 
served to identify the catalytic and ssDNA ligand residues responsible for the 3'-5' 
exonuclease activity, located at the N-terminal one-third of the enzyme (exonuclease 
domain), and to propose the hypothesis, widely demonstrated later, of an evolutionary 
conserved 3'-5' exonuclease active site among distantly related DNA-dependent DNA 
polymerases (Bernad et al., 1989). Such an active site is formed by three N-terminal amino 
acid motifs, named Exol, ExoII, and ExoIII, invariantly containing four carboxylate groups 
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that bind two metal ions and one tyrosine residue that orients the attacking water molecule 
(Bernad et ah, 1989), as it had been shown to occur in E. coli Pol I (Derbyshire et al., 1991). In 
addition, these analyses led us to identify a new motif (Kxih) whose lysine residue plays an 
auxiliary role in catalysis, specifically in family B DNA polymerases (de Vega et al., 1997). 
Similarly, our site-directed mutagenesis studies of 4>29 DNApol pioneered the functional 
analyses of specific amino acids at motifs YxGG, DX 2 SLYP, KxsNSxYG, TX 2 GR, YxDTDS, 
and KxY, highly conserved at the C-terminal two-thirds of eukaryotic DNA polymerases 
from family B. These investigations demonstrated the overlapping between polymerisation 
and protein-primed initiation domains, and served to identify the amino acids involved in 
metal binding and catalysis, as well as DNA, TP and dNTP ligands (Blanco & Salas, 1995, 
1996; Perez-Amaiz et al., 2010). 

We have shown that 4>29 DNApol has three distinctive functional features compared to most 
replicases. First, it initiates DNA replication at the origins located at both ends of the linear 
genome by catalysing the addition of the initial dAMP onto the hydroxyl group of Ser 232 of the 
phage TP, which acts as primer (see below) [reviewed in (Salas 1991,1999; Salas & de Vega, 
2006)]. Second, unlike most replicases that rely on accessory proteins to be stably bound to the 
DNA, as thioredoxin in the case of T7 DNA polymerase (Huber et al., 1987; Tabor et al., 1987), 
the (3-subunit of E. coli Pol III holoenzyme (Kong et al., 1992), or the eukaryotic clamp protein, 
PCNA (Jonsson & Hiibscher, 1997; Kelman, 1997), 4>29 DNApol performs DNA synthesis 
without the assistance of processivity factors, displaying the highest processivity described for 
a DNA polymerase [>70 kb; (Blanco et al., 1989)]. A third distinctive property of §29 DNApol 
is the efficient coupling of processive DNA polymerisation to strand displacement. This 
capacity enables the enzyme to replicate the double-stranded genome without the need for a 
helicase (Blanco et al., 1989). These two features, high processivity and intrinsic strand 
displacement activity, are currently being exploited for the use of 4>29 DNApol in isothermal 
multiple displacement amplification (MDA) (Dean et al., 2002; Dean et al., 2001). These 
amplification technologies based on (|>29 DNApol have two main advantages respect to 
classical PCR DNA amplification: first, no previous sequence information is required, due to 
the use of random hexamer primers, any DNA being susceptible to be amplified, and second, 
amplicons performed by the 4>29 DNApol are much larger than those obtained with PCR. On 
the other hand, the ability displayed by 4>29 DNApol to use circular multiply primed ssDNA 
as template has led to the development of the multiply primed rolling circle amplification, one 
of the most robust technologies to amplify circular templates of variable size (Dean et al., 
2001). This amplification technology is being widely used for genome sequencing, efficient 
amplification and detection of known and unknown circular viral genomes (Johne et al., 2009), 
genotyping of single nucleotide polymorphisms (Qi, et al., 2001), whole genome analysis of 
noncultivable viruses (Johne et al., 2009), detection and identification of circular plasmids in 
zoonotic pathogens (Xu et al., 2008), and for the description of new metagenomes (Lopez- 
Bueno et al., 2009). Recently we have achieved improvements of isothermal MDA by fusing 
DNA binding domains to the C-terminus of 4>29 DNApol (de Vega et al., 2010). The results 
showed that the addition of Helix-hairpin-Helix domains increases DNA binding of the hybrid 
DNA polymerases without hindering their replication rate. In addition, the chimerical DNA 
polymerases displayed an improved DNA amplification efficiency on both circular plasmids 
and genomic DNA and are unique 4*29 DNApol variants with enhanced amplification 
performance. These chimerical DNA polymerases will contribute to make (])29 DNA 
amplification technology one of the most powerful tools for genomics, consolidating MDA 
technology as the alternative to PCR for many applications. 
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Despite the exhaustive mutational analyses carried out throughout 4>29 DNApol, they did 
not provide a structural rationale for both the intrinsic processivity and strand displacement 
capacity of the enzyme. Instead, resolution of the cj)29 DNApol structure gave the insights 
into these two unique properties of the enzyme. These structural studies, carried out in 
collaboration with Tom Seitz's lab (Yale University), showed 4>29 DNApol formed by an N- 
terminal exonuclease domain, containing the 3'-5' exonuclease active site, and a C-terminal 
polymerisation domain that, like in other DNA polymerases, is subdivided into the 
universally conserved palm (containing the catalytic and DNA ligand residues), fingers 
(containing the dNTP ligands) and thumb (which confers stability to the primer) 
subdomains (Kamtekar et al., 2004) (see Figure 2A). 3D-structural comparisons indicated 
that the main difference between other family B DNA polymerases and (|)29 DNApol was 
the presence in the latter of two additional subdomains, both corresponding to sequence 
insertions that we had previously identified as specifically conserved in the protein-primed 
subgroup of DNA polymerases, called TPR1 and TPR2 (Blasco et al., 1990; Dufour et al., 
2000). TPR1 lies at the edge of the palm, while TPR2 contains a P-hairpin structure just 
facing the apex of the thumb subdomain. Palm, thumb, TPR1, and TPR2 subdomains form a 
doughnut-shaped structure that encircles the upstream duplex DNA at the polymerisation 
active site (Berman et al., 2007), constituting an internal clamp that provides the enzyme 
with the maximal DNA-binding stability required for its intrinsic processivity, mimicking 
and making unnecessary the sliding clamp used in other replisomes. On the other hand, 
TPR2, palm and fingers subdomains, together with the exonuclease domain, surround the 
downstream template strand (Berman et al, 2007), forming another tunnel whose narrow 
dimensions (~10 A) do not allow dsDNA binding. Thus, downstream dsDNA has to be 
unwound to enable threading of the template strand through this tunnel to reach the 
polymerisation site, using the same topological mechanism as hexameric helicases to open 
dsDNA regions, and providing a structural basis for the strand displacement capacity of 4>29 
DNApol (Kamtekar et al., 2004; Rodriguez et al., 2005). 

3D resolution of 4>29 DNApol structure also gave us the clues about how primer-terminus 
switches between polymerisation and exonuclease active sites during proofreading of 
polymerisation errors. Comparison of the structures of many apo DNA polymerases with 
their corresponding binary complexes showed that the major conformational changes occur 
mainly in their thumb subdomains, composed of two microdomains with a clear helicoidal 
character linked by a flexible region (Beese et al., 1993; Doublie et al., 1998; Eom et al., 1996; 
Franklin et al., 2001; Li et al., 1998; Shamoo & Steitz, 1999). Conversely, the 4>29 DNApol 
thumb subdomain has an unusual structure since it is small and mainly constituted by a 
long |3-hairpin without identifiable microdomains (Kamtekar et al., 2004). Moreover, 
comparison of the apo enzyme with the binary complexes showed that the thumb 
subdomain does not rotate upon DNA binding (Berman et al., 2007). We have shown that 
the prevention of a potential thumb movement by crosslinking the tips of the TPR2 and 
thumb subdomains did not affect the partitioning of the primer-terminus between the 
polymerisation and editing active sites (Rodriguez et al., 2009). The impeded motion of the 
TPR2 subdomain suggests that rotation of the DNA is not required to transfer the primer- 
terminus between the polymerisation and editing active sites in 4>29 DNApol, most likely as 
there is not any structural barrier in between. Then, how does the frayed terminus travel to 
the exonuclease active site? Considering the tj>29 DNApol thumb subdomain as a nearly 
static structure, the primer switching would be accomplished by a passive diffusion of the 
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frayed primer-terminus. The energetically unfavourable gradual melting of three-four base 
pairs should be progressively offset by new and specific interactions established with DNA 
ligands of the thumb subdomain, as suggested (Perez-Arnaiz et al., 2006). Such interactions 
would also channel the primer-terminus in the appropriate orientation to contact with 
ssDNA ligands of the exonuclease domain responsible for the stabilisation of the primer- 
terminus at the editing active site (de Vega et al., 1996, 1998b; Kamtekar et al., 2004; Perez- 
Arnaiz et al., 2006; Rodriguez et al., 2009). 
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Fig. 2. (A) 4>29 DNApol-DNA complex. (B) Structure of the 4>29 DNApol-Terminal Protein 
complex. Reproduced with permission from Kamtekar, S., Berman, A.J., Wang, ]., Lazaro, 
J.M., de Vega, M., Blanco, L., Salas, M. & Steitz, T.A. (2006). The phi29 DNA 
polymerase:protein-primer structure suggests a model for the initiation to elongation 
transition. EMBO ]. Vol. 25, No. 6, pp. 1335-1343. 
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Additionally, recent resolution of t|>29 DNApol tertiary complex structures has allowed us to 
dissect the subtle changes in the polymerisation active site that take place upon dNTP 
binding, providing the structural basis for the mechanism of translocation. Thus, once the 
catalysis of the phosphoester bond formation between the a-phosphate of the incoming 
dNTP and the OH- group of the priming nucleotide takes place, the pyrophosphate 
produced leaves the DNA polymerase, breaking the electrostatic crosslink that kept the 
fingers subdomain in the closed state. Concomitantly to the fingers opening, residues Tyr 254 
and Tyr 390 move back into the nucleotide insertion site, leading to one position backwards 
translocation of the nascent base pair out of the binding pocket, as now the nucleotide 
insertion site is sterically inaccessible (Berman et al., 2007). This translocation allows the 3' 
OH-group of the newly added nucleotide to be in a competent position to attack 
nucleophylically the a-phosphate of the incoming nucleotide during the next nucleotide 
insertion event (Berman et al., 2007) (see Figure 3). 



D249 


Fig. 3. Comparison of the binary (yellow) and ternary (green) complex structures of 4)29 
DNApol. The mechanistically significant amino acid movements are indicated. Reproduced 
with permission from Berman, A.J., Kamtekar, S., Goodman, J.L., Lazaro, J.M., de Vega, M., 
Blanco, L., Salas, M. & Steitz, T.A. (2007). Structures of phi29 DNA polymerase complexed 
with substrate: the mechanism of translocation in B-family polymerases. EMBO J. Vol. 26, 
No. 14, pp. 3494-3505. 

2.2.2 4>29 Terminal protein 

As already mentioned, the primer TP forms a heterodimer with the DNA polymerase for 
recognition and further initiation of TP-primed DNA replication. To discriminate between 
the two different functions, the TP molecule linked to the 5' DNA ends is called parental TP 
and the TP present in the complex with DNA polymerase is called primer TP. 
Crystallographic resolution of the structure of 4>29 DNApol/primer TP heterodimer has 
shown that the TP has an elongated three-domain structure (Figure 2B) (Kamtekar et al.. 
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2006). The N-terminal domain (residues 1-73) is structurally disordered likely because it is 
not interacting with the polymerase (Kamtekar et al., 2006; Perez-Arnaiz et al., 2007). The 
intermediate domain (residues 74-172) contains two long a-helices and a short P-turn-P 
structure and makes extensive contacts with the TPR1 subdomain of the polymerase. This 
interface has many charged residues and includes two salt bridges between arginine 
residues in the TP and glutamic acid residues in the TPR1 subdomain (R158:E291; 
R169:E322). It is connected through a hinge region to the C-terminal priming domain 
(residues 173-266), a region highly electronegative that has a four-helix bundle topology. 
Ser 232 , which provides the priming hydroxyl group for DNA synthesis, lies in a disordered 
loop (residues 227-233) at the end of the priming domain close to the active site of the DNA 
polymerase (see Figure 4, left panel). The priming domain structure shows interactions 
between many of their acidic residues and positively charged residues of the thumb 
subdomain of the polymerase (e.g., between E191:K575 and D198:K557), with residue R96 of 
the exonuclease domain and with TPR2 subdomain residues (Kamtekar et al., 2006; 
Rodriguez et al., 2004). Thus, the upstream duplex DNA "tunnel" of 4>29 DNApol encircles 
the TP priming domain whose overall dimensions and its negative charge mimics DNA in 



Fig. 4. Left, placement of TP priming residue Ser 232 (in grey) and penultimate template 
nucleotide at the 029 DNApol active site (catalytic aspartates are shown in red). Right, flexible 
orientations of TPR1 loop in the apoenzyme (coloured in magenta) and its stable and moved 
out structural conformation shown in the DNA polymerase/TP complex (coloured in orange). 
TP is coloured in yellow. Green arrows indicate the suggested conformational changes of both, 
the DNA polymerase TPR1 loop and the TP priming domain to allow the formation of a stable 
heterodimer. Reproduced with permission from Perez-Arnaiz, P., Longas, E., Villar, L., Lazaro, 
J.M., Salas, M. & de Vega, M. (2007). Involvement of phage 4>29 DNA polymerase and terminal 
protein subdomains in conferring specificity during initiation of protein-primed DNA 
replication. Nucleic Acids Research. Vol. 35, No. 21, pp. 7061-7073. 
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its interactions with the polymerase during initiation (de Vega et al, 1998a; Kamtekar et al., 
2006). This explains why DNA synthesis by the heterodimer cannot begin at internal sites of 
the phage genome, as the upstream 3' template would sterically clash with the TP 
(Kamtekar et al., 2006). 

Our previous studies showed that 4>29, GA-1 and Nf DNA polymerases display a great 
specificity for their corresponding primer TPs, as the heterologous systems did not give any 
detectable initiation product (Gonzalez-Huici et al., 2000a; Longas et al., 2006). By means of 
chimerical proteins, constructed by swapping the priming domain of the related 4>29 and 
GA-1 TPs, we showed that DNA polymerase can form catalytically active heterodimers 
exclusively with that chimerical TP containing the N-terminal and intermediate domains of 
the homologous TP, suggesting that the interaction between the polymerase TPR1 
subdomain and the TP intermediate domain is the one main responsible for the specificity 
between both proteins (Perez-Arnaiz et al., 2007). 

In addition, the independent expression of the 4>29 TP priming domain and intermediate 
plus N-terminal domains showed that the former can only prime initiation in the presence 
of the latter that assists the TP-dAMP formation most probably by inducing a 
conformational change in the DNA polymerase (Perez-Amaiz et al., 2007). The structure of 
the ((>29 DNApol forming a complex with the TP is very similar to that of the apo enzyme, 
the main conformational changes being restricted to TPR1 residues 304-315 (Kamtekar et al., 
2006). Such residues form loops with a high degree of flexibility in the apo enzyme. By the 
contrary, the 4>29 heterodimer structure shows that this loop moves out to allow the TP to 
access the polymerase active site. Altogether, these results led to propose a model for the 
DNA polymerase-TP interaction in which the TP intermediate domain would recognise 
specifically and interact with the DNA polymerase TPR1 subdomain. Such interaction 
would promote the change of the TPR1 loop from a flexible to the stable moved out 
conformation that now would allow the proper (prone to catalysis) placement of the TP 
priming domain into the DNA polymerase structure (Perez-Arnaiz et al., 2007) (see Figure 4, 
right panel). 

2.2.3 Recognition of replication origins by the DNA polymerase/TP heterodimer 

The cf>29 DNApol/TP heterodimer recognises the replication origins at the genome ends (see 
Figure 1). Blunt-ended DNA fragments containing the left or right 4>29 DNA ends, but not 
internal 4>29 DNA fragments, were active as templates in in vitro initiation reactions (Garcia 
et al., 1984; Gutierrez et al., 1986a; Gutierrez et al., 1986b). However, the activity was 6- to 
10-fold lower than that obtained with TP-DNA (Gutierrez, et al. 1986a; Gutierrez et al., 
1986b). These results indicated on the one hand, that specific DNA sequences located at the 
4>29 DNA ends are involved in origin recognition and on the other hand, that the parental TP 
is a major signal in the template for such a recognition, strongly suggesting that the 
heterodimer is recruited to the origin through interactions with the parental TP. In 
agreement with this, detection of initiation activity by using heterologous systems in which 
DNA polymerase, primer TP and TP-DNA came from 4>29 and Nf related phages, showed 
that initiation was selectively enhanced when the DNA polymerase and the TP-DNA were 
from the same phage, implying a specific interaction between DNA polymerase and 
parental TP (Gonzalez-Huici et al., 2000a). In line with this, a chimerical ((>29 DNApol 
containing the GA-1 DNA polymerase TPR1 subdomain was capable of catalysing the 
initiation reaction primed by GA-1 TP but solely in the presence of 4>29 TP-DNA, supporting 
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the hypothesis that a major contribution to the parental TP recognition is carried out by the 
DNA polymerase (Perez-Arnaiz et al, 2007). Similarly, mutations introduced at several TP- 
intermediate domain residues rendered TP mutants that could not support DNA replication 
when they acted as parental TP, indicating also a contribution of the primer TP in the 
specific recognition of the replication origins (Illana et al., 1998; Serna-Rico et al., 2000). 
Furthermore, measurement of the ability of the different DBPs coming from <j>29, Nf and 
GA-1 bacteriophages to activate homologous and heterologous replication origins showed 
also a specific recognition of each nucleoprotein complex by the homologous DNA 
polymerase/TP heterodimer (Freire et al., 1996). The fact that i/29 DBP stimulates the 
initiation activity of the heterodimer formed by GA-1 primer TP and a chimerical (j)29 
DNApol containing the TPR1 subdomain of GA-1 DNA polymerase to a similar extent as 
that of the cj>29 heterodimer, favours the hypothesis of a main and specific recognition of the 
DBP by the DNA polymerase (Perez-Arnaiz et al., 2007). 

2.2.4 A sliding-back mechanism for protein-primed DNA replication 

As already indicated, the DNA ends of 4>29 and those of the (f>29-related phages have a 
reiteration of three nucleotides (3'-TTT.... 5'). Once the replication origins are specifically 
recognised by the heterodimer formed by the DNA polymerase and the primer TP (Blanco 
et al., 1987; Freire et al., 1996; Gonzalez-Huici et al., 2000a; Gonzalez-Huici et al., 2000b; 
Perez-Arnaiz et al., 2007), the DNA polymerase catalyses the formation of a phosphoester 
bond between the initiator dAMP and the hydroxyl group of Ser 232 of the TP (see Figure 1), a 
reaction directed by the second T at the 3' end of the template strand (Mendez et al., 1992) 
and performed by the same catalytic residues responsible for canonical polymerisation 
(Blanco & Salas, 1995, 1996). Modelling of an incoming dNTP and a template strand onto 
the <j>29 DNApol/primer TP complex shows that the priming Ser 232 of TP is placed at the 
catalytic site of the DNA polymerase in line to attack nucleophylically the a-phosphate of 
the incoming nucleotide to form the phosphoester bond (Figure 4, left panel). The model 
also suggests that the 3' end of the template strand goes deep into the catalytic site of the 
DNA polymerase through the downstream template strand tunnel until it positions the 
penultimate 3' dTMP of the template strand at the catalytic site, allowing it to direct 
insertion of the initiator dAMP. To perform TP-DNA full-length synthesis, the TP-dAMP 
initiation product translocates backwards one position to recover the template information 
corresponding to the first 3'-T, the so-called sliding-back mechanism that requires a terminal 
repetition of 2 bp. This reiteration permits, prior to DNA elongation, the asymmetric 
translocation of the initiation product, TP-dAMP, to be paired with the first T residue 
(Mendez et al., 1992) (see scheme in Figure 5). 

Our studies have shown how the sliding-back mechanism, or variations on it, seems to be a 
common feature of protein-priming systems to restore full-length DNA. Thus, in the case of 
the <fi29-related phage GA-1, initiation also occurs at the 3' second nucleotide of the template 
(3'-TTT) and, to a lesser extent, at the third nucleotide (Illana et al., 1996). The (j>29-related 
phage Nf and the Streptococcus pneumoniae phage Cp-1 initiate at the 3' third nucleotide of 
their terminal repetition (3'-TTT) (Longas et al., 2008; Martin et al., 1996b), whereas the E. 
coli phage PRD1 initiates at the fourth nucleotide (3'-CCCC) (Caldentey et al., 1993), 
requiring two and three consecutive sliding-back steps, respectively, to recover the DNA 
end information (stepwise sliding-back). The adenovirus genome ends present a more 
complex reiteration (3'-GTAGTA), the 3' fourth to six template positions directing the 
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formation of the TP-CAT initiation product. Thus, recovery of the 3' ends is performed by a 
single jump, after which TP-CAT is paired with the terminal 3'-GTA (jumping-back) (King & 
van der Vliet, 1994) (see scheme in Figure 5). 
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Fig. 5. Sliding-back (jumping-back) model for the transition from initiation to elongation. TP 
is represented as a pink oval and DNA polymerase as a grey square. The internal template 
nucleotide that directs the insertion of the initiator nucleotide is shown in bold red letter. 
Yellow box represents the catalytic active site of the DNA polymerase. 
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2.2.4.1 Why sliding-back during genome replication? 

Protein-primed initiation can be predicted to be an inaccurate reaction. The insertion fidelity 
of protein-primed initiation in 4>29 has been shown to be quite low, the insertion 
discrimination factor being about 10 2 . Even more, the 3'-5' exonuclease activity of 4>29 
DNApol is unable to act on the TP-dNMP initiation complex, precluding the possibility that 
a wrong dNMP covalently linked to TP could be proofread (Esteban et al., 1993). If 
misincorporation during the initiation reaction takes place, a mismatch would be produced 
with the terminal 3-T on the template after the sliding-back of the incorrect TP-dNMP 
complex, unfavouring the further elongation step. If an incorrect TP-dNMP product were 
elongated it would be corrected in the next round of replication, because it would not serve 
as a template (Esteban et al, 1993). Thus, the sliding-back and its variations are envisaged to 
increase the fidelity during the initiation reaction, as several base pairing checking steps 
have to occur before elongation of the initiation product takes place (King & van der Vliet, 
1994; Mendez et al, 1992). The fact that other TP-containing genomes also contain some 
kind of sequence repetitions at their ends supports the hypothesis that the sliding-back type 
of mechanism could be a common feature of protein-primed replication systems (Mendez et 
al., 1992). 

2.3 Transition from protein-primed to DNA-primed replication 

Our functional analyses established that the 4>29 DNApol/primer TP heterodimer do not 
dissociate immediately after initiation or after sliding-back (Mendez et al., 1997). The same 
DNA polymerase molecule incorporates 5 nucleotides to the primer TP while is still 
complexed with the latter (initiation mode), undergoes some structural change during 
incorporation of nucleotides 6-9 (transition) and finally dissociates from the primer TP when 
nucleotide 10 is incorporated into the nascent DNA chain (elongation mode) (Mendez et al., 
1997). These results probably reflect the polymerase requirement for a DNA primer of a 
minimum length to catalyse DNA elongation efficiently. 

We have shown that the strength of the cj)29 DNA pol-primer TP interaction is differently 
contributed by the TP priming and intermediate domains (Perez-Arnaiz et al., 2007), 
supporting the model proposed for the transition from the protein-primed initiation to the 
DNA-primed elongation modes (Kamtekar et al., 2006). Thus, the TP intermediate domain 
would be in a fixed orientation on the polymerase by means of stable contacts with the TPR1 
subdomain. The weak interaction observed with the DNA polymerase would facilitate the 
TP priming domain to rotate following the helicoidal pathway as DNA is synthesized. The 
relative motion of the TP priming domain with respect to the fixed TP intermediate domain 
would be possible due to the flexibility of the hinge region that connects both domains. 
After incorporation of 6-7 nucleotides the proximity of the priming Ser to the hinge region 
would impede a further priming domain rotation, promoting complex dissociation 
(Kamtekar et al., 2006) (see Figure 6). 

2.4 DNA-primed elongation 

Once the initiation, sliding-back and transition steps have been fulfilled and 4>29 DNApol 
has separated from the primer TP, the DNA polymerase resumes TP-DNA replication; 
therefore, the same DNA polymerase molecule accounts for complete genome replication 
from a single binding event (Blanco et al., 1989). As mentioned before, the high stability of 
the 4>29 DNApol/DNA complex, by virtue of the "internal sliding-clamp-like" structure 
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Fig. 6. A model for the transition from initiation of replication to elongation (see main text 
for details). Reproduced with permission from Kamtekar, S., Berman, A.J., Wang, ]., Lazaro, 
J.M., de Vega, M., Blanco, L., Salas, M. & Steitz, T.A. (2006). The phi29 DNA 
polymerase:protein-primer structure suggests a model for the initiation to elongation 
transition. E MBO J. Vol. 25, No. 6, pp. 1335-1343. 

formed by thumb, palm, TPR1 and TPR2 subdomains, allows the polymerase to perform 
complete DNA replication without the assistance of processivity factors, in contrast to most 
replicative DNA polymerases. In addition, the singular TPR2 subdomain enables the c)>29 
DNApol to couple polymerisation to the unwinding of the downstream dsDNA regions 
(strand displacement capacity) making unnecessary the intervention of a helicase-like 
protein (Blanco et al., 1989). 

As mentioned above, since replication starts at both 4>29 DNA ends and is coupled to strand 
displacement, this results in the generation of so-called type-I replication intermediates (see 
Figure 1). The ssDNA stretches generated are bound by the viral SSB, essential for elongation 
of replication in vivo (Mellado et al., 1980). Binding of 4>29 SSB to cj)29 DNA replicative 
intermediates has been demonstrated to occur in vitro (Gutierrez et al., 1991b). The protein 
binds in a cooperative way (Soengas, et al., 1994) stimulating dNMP incorporation during <|)29 
DNA replication (Gutierrez et al., 1991a), and increasing the elongation rate, mainly when 4)29 
DNApol mutants impaired in strand displacement are used, probably due to the helix 
destabilising activity of the 4>29 SSB (Soengas et al., 1995). When the two converging DNA 
polymerases merge, a type-I replication intermediate becomes physically separated into two 
type-II replication intermediates (Gutierrez et al., 1991b; Inciarte et al., 1980). Continuous 
elongation by the DNA polymerase completes replication of the parental strand. 
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2.5 Termination of TP-DNA replication 

For termination of genome replication, the 4>29 DNApol encounters a covalently linked TP 
molecule (see Figure 7, upper panel), terminating DNA replication by an as yet unknown 
mechanism. As already indicated, the TPR2 insertion of the 4>29 DNApol, together with the 
exonuclease domain and the fingers and palm subdomains form a downstream tunnel 
whose narrow dimensions precludes the passage of a dsDNA through it. Considering that 
the terminal base of the template is covalently linked to the Ser 232 of TP, the priming loop of 
the parental TP has to enter the downstream template-binding tunnel of 4>29 DNApol to 
allow the last 5'-dAMP (covalently linked to the parental TP) to be replicated. However, the 
dimensions of such a tunnel do not allow the TP to pass through it to reach the catalytic 
active site, so a disruption of the interactions of TPR2 with the exonuclease domain and the 
thumb subdomain of the polymerase is required to get an opened tunnel (see Figure 7, lower 




Fig. 7. Hypothetical partial opening of the TPR2 and thumb subdomains to allow the last 5'- 
template nucleotide to reach the polymerisation active site of the DNA polymerase. 
Template and primer strands are coloured in blue and magenta, respectively. 
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panel). By using primer/5'-streptavidine-bound template DNA to mimic the parental TP, 
we showed that all but 4 terminal nucleotides were replicated by the 4>29 DNApol (de Vega 
et al., 1999). This situation could be parallel to the physiological one, in which <\>29 DNApol 
has to finish the replication of a DNA template strand that contains a TP molecule 
covalently linked at its 5' end. It is tempting to speculate that during termination of 
replication, a specific interaction between DNA polymerase and TP is required to allow 
synthesis of the terminal nucleotide. 

2.6 Terminal protein-primed DNA amplification 

By using appropriate amounts of the four 4>29 DNA replication proteins described above, 
primer TP, DNA polymerase, the double-stranded DNA-binding protein, and the single- 
stranded DNA-binding protein, we were able to amplify limited amounts of the 4>29 TP- 
DNA molecule by three orders of magnitude after 1 hr of incubation at 30 °C. Moreover, the 
quality of the amplified DNA was demonstrated by transfection experiments, in which 
infectivity of the synthetic (amplified) cj)29 TP-DNA, measured as the ability to produce 
phage particles, was identical to that of the natural cj)29 TP-DNA obtained from virions 
(Blanco et al., 1994), leading us to establish some of the requisites for the development of 
isothermal DNA amplification strategies based on the 4>29 DNA replication machinery to 
amplify very large (> 70 kb) segments of exogenous DNA. 

3. In vivo compartmentalisation of <j>29 DNA replication 

It is well established that replication of phage genomes occurs at specific intracellular 
locations by the use of large organising structures that bring together replication factors to 
enhance the efficiency of the replication process. Some lines of evidence support that 
replication of phage DNA takes place in close association with the bacterial membrane 
(Firshein, 1989; Mosig & Macdonald, 1986; Siegel & Schaechter, 1973). Our recent 
investigations have also given evidences concerning compartmentalization of phage (]>29 
DNA replication in B. subtilis cells. 

We have shown that cj)29 TP binds to dsDNA through its N-terminal domain in a non¬ 
sequence dependent way, both in vitro and in vivo (Munoz-Espin et al, 2010; Zaballos & Salas, 
1989). This capacity enables the parental TP, and therefore the viral TP-DNA, to associate with 
the bacterial nucleoid early after injection of the 4>29 genome (see scheme in Figure 8) where 
the B. subtilis RNA polymerase is also located (Munoz-Espin et al., 2010). There, synthesis of 
the 4>29 early proteins DNA polymerase, primer TP, SSB and DBP, essential for in vivo 4>29 
DNA replication takes place. Once synthesized, primer TP binds the bacterial chromosome 
and recruits the DNA polymerase to form the heterodimer that will recognise TP-DNA 
replication origins. At this stage, replication of TP-DNA will start from both terminal origins 
giving rise to the replicative intermediates type I and II, as it has been observed by electron 
microscopy (Harding & Ito, 1980; Inciarte et al., 1980; Sogo et al., 1982) (see scheme in Figure 8). 
By fluorescence microscopy, we have shown that at middle infection times, the DNA 
polymerase, TP, and viral TP-DNA are reorganised adopting a peripheral helix-like 
distribution toward the poles of the cell (Munoz-Espin et al., 2009; Munoz-Espin et al., 2010). 
Although the pathway followed by the cj)29 replicative machinery from the nucleoid to 
bacterial peripherical regions remains to be determined, it has been proposed that it could 
travel associated to the replication of bacterial chromosome (Munoz-Espin et al., 2010), as 
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newly synthesized bacterial DNA is translocated towards the cell poles via a helical 
structure (Berlatzky et al., 2008). 

A B C D E 




Fig. 8. Model of nucleoid-associated early 4>29 DNA replication organised by the TP. (A) 
Attachment of 4>29 TP-DNA to the bacterial nucleoid surface (gray mass at bottom) through 
the N-terminal domain (red) of the parental TPs (red and green). (B) Recruitment of the 4>29 
DNApol/primer TP heterodimer to the replication origins of TP-DNA. (C) Processive 
elongation of the nascent DNA strands (red lines) coupled to strand displacement. (D and E) 
Once DNA replication is completed, two 4>29 TP-DNA molecules are ready for another 
round of replication. For simplicity, other viral proteins involved in DNA replication are not 
drawn. Reproduced with permission from Munoz-Espin, D., Holguera, I., Ballesteros-Plaza, 
D., Carballido-Lopez, R. & Salas, M. (2010). Viral terminal protein directs early organisation 
of phage DNA replication at the bacterial nucleoid. Proceedings of the National Academy of 
Sciences of USA. Vol. 107, No. 38, pp. 16548-16553. 

The 4>29 membrane protein pl6.7 has a non-sequence specific DNA binding capacity (Meijer 
et al., 2001) that enables it to interact with the 4>29 replication origins through recognition of 
the parental TP (Serna-Rico et al., 2003). This protein also shows a helix-like pattern at the 
membrane of infected cells, most probably being involved in the compartmentalization of in 
vivo membrane-associated 4>29 DNA replication through a direct contact with TP-DNA, 
organising the viral replicating intermediates at numerous peripheral locations (Albert et al., 
2005; Munoz-Espin et al., 2009) (see Figure 9). Different experimental approaches have 
demonstrated that protein pl6.7 interacts directly with the B. subtilis actin-like cytoskeleton 
protein MreB (Munoz-Espin et al., 2009). This protein forms helix-like filamentous structures 
in vivo essential for the control of the bacterial rod-shaped morphology (Jones et al., 2001), 
suggesting that MreB would contribute to efficient <j)29 DNA replication by recruiting 
protein pl6.7 to the appropriate sites at the cell membrane allowing simultaneous 
replication of multiple templates at numerous peripheral locations. Further evidence is the 
finding that 4>29 DNA replication is severely affected in AMreB cytoskeleton mutants 
(Munoz-Espin et al., 2009). 

<t>29 gene 1 codes for a small protein (pi) that assembles into long protofilaments forming 
bidimensional sheets (Bravo & Salas, 1998) in association with the bacterial membrane in 
vivo (Serrano-Heras et al., 2003). Cell fractionation studies indicated that protein pi is 
membrane-associated both during synthesis of cj)29 DNA and after blocking <J)29 DNA 
replication (Bravo & Salas, 1997). Membrane-association of pi also occurs in the absence of 
other viral components, suggesting that protein pi contacts the bacterial membrane directly 
(Bravo & Salas, 1997; Serrano-Heras et al., 2003). Phage e|>29 DNA replication was shown to 
be significantly reduced when non-suppressor B. subtilis cells were infected with mutant 
phage susl(629) at 37°C (Bravo & Salas, 1997; Prieto et al., 1988). In addition, protein pi was 
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Fig. 9. Model of membrane-associated late (f2.9 DNA replication organised by the MreB 
cytoskeleton. MreB, Mbl, and MreBH are shown to form a putative triple helical structure 
closely associated with the inner surface of the membrane. Each dimeric unit of protein pl6.7 
is represented by a yellow hexagon. The tridimeric pl6.7 units form oligomers in a helix-like 
localisation at the cell membrane. For simplicity, other viral proteins involved in DNA 
replication are not drawn. Reproduced with permission from Munoz-Espin, D., Daniel, R., 
Kawai, Y., Carballido-Lopez, R., Castilla-Llorente, V., Errington, ]., Meijer, WJ & Salas, M. 
(2009). The actin-like MreB cytoskeleton organises viral DNA replication in bacteria. 

Proceedings of the National Academy of Sciences of USA. Vol. 106, No. 32, pp. 13347-13352. 

also shown to interact with the viral TP in vitro (Bravo et al., 2000). These results suggest that 
protein pi is a component of a membrane-associated structure which would play, in 
addition to pl6.7, a role in the organisation of 4>29 DNA replication by providing an 
anchoring site for the replication machinery. 

4. <(>29 codes for a B.subtilis uracil-DNA glycosylase inhibitor. 

Downstream to 4>29 gene 1 there is an open reading frame (ORF56) encoding an acidic 
protein of 56 amino acids (protein p56) that shows a dimeric state in solution and that 
accumulates throughout the 4>29 infective cycle (Serrano-Heras et al., 2006). Chemical 
crosslinking assays showed that viral p56 interacts, both during the infective process and in 
the absence of viral components, with B. subtilis uracil-DNA glycosylase (UDG), a key 
enzyme that eliminates uracil residues from the DNA during the Base Excision Repair (BER) 
pathway (Serrano-Heras et al., 2006). In vitro assays demonstrated that such an interaction 
inhibits UDG. In agreement with this, extracts from (|)29-infected cells showed a dramatic 
drop in bacterial UDG activity, in contrast to the nearly 90% of the activity that remained 
after incubation with extracts from non-infected cells. 

4.1 Role of protein p56 in <|>29 DNA replication 

Inhibition of the cellular UDG activity after phage infection was established previously in 
two systems. The first was the inhibition of the host UDG after infection of B. subtilis with 
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the uracil-containing phage PBS2 (Friedberg et al, 1975). In this case, inhibition of cellular 
UDG was crucial to prevent the elimination of uracils from the viral genome. Similarly, 
bacteriophage T5 inhibits E. coli UDG, although the physiological role of this inhibition is 
still unclear, as its genome does not contain uracils (Warner et al, 1980). 

Both, in vitro and in vivo assays have recently demonstrated the ability of 4>29 DNApol to 
insert, extend and use as template dUMP residues with nearly the same efficiency that 
dTMP to give full-length DNA (Serrano-Heras et al., 2008). Whereas removal by the UDG of 
the uracil residues incorporated into the phage genome caused a drastic reduction in the 
efficiency of 4>29 DNA replication, as 4>29 DNApol is unable to use as template the resulting 
abasic sites (de Vega & Salas, unpublished results), such an inhibitory effect caused in 
replication by UDG was counteracted by the addition (in vitro) or expression (in vivo) of 
protein p56 (Serrano-Heras et al., 2008). 

As in the case of bacteriophage T5, 4>29 TP-DNA does not contain uracils, then, why does 
phage <\>29 synthesize a UDG inhibitor? It has been suggested that this inhibition is related to 
the mechanism of 4>29 DNA replication. As illustrated in Figure 1, during 4>29 TP-DNA 
replication replicative intermediates (type-I and type-II) with long stretches of single- 
stranded DNA are generated. The presence of uracil in the replicative intermediates could 
recruit components of the cellular BER pathway, such as UDGs and AP endonucleases. 
Further removal of the uracil moiety would render an abasic site that cannot be used as 
template by the 4>29 DNApol. By the contrary, the AP site will be further recognised by the 
cellular AP endonuclease that would introduce a nick into the phosphodiester backbone 
with accompanying loss of the terminal DNA region, giving rise to shorter viral DNA 
molecules lacking one parental TP. Therefore, the action of the cellular UDG on single- 
stranded DNA regions of the 4>29 replicative intermediates would be harmful for viral 
replication (Serrano-Heras et al., 2008). 

5. Conclusions and future research 

The availability of an efficient in vitro 4>29 TP-DNA replication system, as well as the 
biochemical characterisation of the different proteins involved, have greatly contributed to 
lay the foundations of the different steps in the protein-priming mechanism of DNA 
replication. In this, a specific DNA polymerase catalyses the formation of a phosphoester 
bond between the initiator dNMP and the OH group of a specific residue in the TP. In vitro 
systems have been also developed for the replication of adenovirus and bacteriophages 
GA1, Nf, PRD1 and Cpl, showing that they use a similar protein-priming mechanism for 
the initiation of replication. Thus, the mechanism of initiation of t(i29 DNA replication can be 
extrapolated to other organisms containing a TP covalently linked to the 5'-ends of their 
genomes, as human hepatitis B virus, mitochondrial plasmids, linear chromosomes and 
plasmids of Streptomyces, as well as several virus infecting Archaea, as halovirus. 

Despite of the advances in the knowledge of the protein-priming process, several questions 
regarding the initiation at internal positions of the TP-containing genomes remain to be 
elucidated, as the basis for the specificity for the templating nucleotide. In this respect, the 
use of chimerical TPs, constructed by swapping the priming domains of the related 4>29 and 
Nf proteins, allowed us to conclude that this domain is one of the structural determinants 
that dictates the internal 3' nucleotide used as template during initiation. On the other hand, 
the backwards motion of the primer TP with respect to the fixed template molecule, implies 
a breakage of the pair TP-A:T (in the case of bacteriophages i/29, Cpl, Nf and GA-1) or, most 
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drastically the triple base pairing TP-CAT:GTA, as in the case of adenovirus. This 
energetically unfavoured step should be explained by a power stroke mechanism, by which 
the energy released after dissociation of the pyrophosphate could drive the backwards 
movement of the TP-dNMP initiation product with respect to the DNA polymerase and 
template strand with the consequent correct base pairing with the preceding 3' nucleotide, 
to reach a more energetically favoured situation. The elucidation of the conformational 
changes that govern the sliding-back mechanism will give the clues to understand such a 
special way to initiate genome replication. 

Termination of replication of TP-containing genomes is still an unresolved matter. We have 
shown that protein-primed DNA polymerases exhibit an exquisite specificity for their TP 
counterparts during the initiation step of TP-DNA replication. Based on the 4>29 DNApol 
structure, it seems obvious that to let the last 5'dAMP to be copied during termination, at 
least the parental TP priming loop of the template strand has to access the catalytic site of 
the polymerase following the downstream template tunnel pathway. The question that 
arises is whether there is also specificity between the DNA polymerase and the parental TP 
during termination, and how the DNA polymerase performs this step. 

We are starting to understand how replication of t()29 is organised and compartmentalised 
into the bacterium, acting in concert with cellular factors to increase the efficiency of this 
biological process. Thus, early after 4>29 infection TP-DNA replication takes place at the 
bacterial nucleoid by means of a non-specific interaction between the parental TP and the 
bacterial chromosome. Later during infection, the cj)29 DNA replication machinery is 
organised in peripheral helix-like structures through an interaction of the phage protein 
pl6.7, which recognises the dsDNA of replicating TP-DNA molecules, with the actin-like 
MreB cytoskeleton. 

In addition, phage 4>29 protein p56 is essential in the viral DNA replication cycle since it 
prevents the impairment caused by the host UDG. Inhibition of UDG has been proposed to 
be a defence mechanism developed by 4>29 to prevent formation of abortive replicative 
intermediates. This is the first case reported of an UDG inhibitor encoded by a non-uracil 
containing DNA. 

The mechanistic details of how B. subtilis MreB is specifically employed by phage cj)29, and 
how they are temporally and spatially organized may be main directions for future 
experiments. Since bacteriophages contain genomes with a limited size due to their small 
dimensions, it is expected that new bacterial proteins interacting with viral components may 
be discovered. A major challenge is to identify these novel targets that might be used by 
bacteriophages to optimize the production of high numbers of progeny. 
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1. Introduction 

Chromosome replication is a basic biological process that is essential for cellular 
proliferation. Owing to its fundamental nature the process of DNA replication is highly 
conserved among eukaryotic organisms. In the course of a typical somatic cell cycle, cells are 
"born" in G1 phase. Throughout this phase the cells grow in size increasing their mass and 
their protein synthetic capacity. As cells grow during G1 they monitor their size and protein 
synthetic capacity along with a variety of parameters in the external milieu including 
nutrient availability (Hartwell, 1974). When the internal and external environments are 
deemed to be suitable, the cell will commit to a cell division cycle. Initiating a cell cycle is a 
serious matter for yeast cells in that once they initiate the program they are committed to its 
completion (Hartwell, 1974). Should the cell have insufficient resources or capacity to 
successfully complete division the result is arrest and inevitable loss of viability. The 
decision to initiate a division cycle is taken at START in the budding yeast Saccharomyces 
cerevisiae; this event is referred to as R, the restriction point in mammalian cells (Hartwell, 
1974; Pardee, 1974). If conditions are appropriate for proliferation, G1 phase cells will make 
the transition to S-phase where DNA replication ensues followed by progression through 
G2 and chromosome segregation at mitosis followed by completion of the cycle at 
cytokinesis. 

Prior to completing the START transition and committing to a round of cell division S. 
cerevisiae can adopt several alternative fates. Haploid cells can conjugate with a partner of 
the opposite mating type to form diploids (Cross et al, 1988). Alternatively, both haploid 
and diploid cells are capable of entering a quiescent state in response to deprivation of some 
nutrients (Gray et al, 2004). Under the appropriate conditions diploid MATa /a cells can exit 
the mitotic cell cycle and initiate the meiotic differentiation program leading to sporulation 
(Kupiec et al., 1997). This differentiation program is triggered by starvation for nitrogen and 
glucose. Upon encountering these conditions diploid S. cerevisiae will arrest in G1 phase 
with unreplicated DNA. The meiotic differentiation process initiates from G1 phase where 
the starved cells accumulate, and progresses through premeiotic DNA replication followed 
by extensive homologous recombination. Meiotic recombination is followed by two 
consecutive rounds of chromosome segregation, the meiosis I (reductional) division, and 
meiosis II (equational) division. Significantly, these two rounds of chromosome division 
occur without an intervening S-phase hence the meiotic progeny are haploid and the 
haploid nuclei are encapsulated in spores that are highly resistant to environmental insult. 
Hence, with regard to chromosome metabolism, the meiotic program resembles a somatic 
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cell cycle but many aspects of the process are modified to achieve the desired developmental 
aim of ploidy reduction, increased genetic diversity, and spore formation. 

2. The transition to premeiotic S-phase 

In mitotically proliferating S. cerevisiae cells, progression past START and entry into S-phase 
is dependent upon the Cyclin Dependent kinase Cdkl (formerly known as Cdc28) (Reed, 
1992). Cdkl is activated in Gl-phase by the three G1 cyclins Clnl, Cln2, and Cln3 
(Richardson et al., 1989). The Cln-Cdkl complexes serve multiple functions in promoting 
entry into a mitotic cell cycle (Levine et al, 1995). Starvation for nitrogen and glucose causes 
diploid cells to arrest in G1 and precludes progression into a mitotic cell division cycle. 
Under these conditions the CLN1, CLN2 and CLN3 genes are repressed (Gallego et al, 1997). 
The CLN cyclins are not required for entry into or progression through meiotic 
differentiation and indeed enforced expression of CLN cyclins under starvation conditions 
impedes the initiation of meiotic differentiation (Dirick et al., 1998; Colomina et al., 1999). It 
has been proposed that CLN expression acts as the switch that determines whether a cell 
initiates a mitotic division cycle or enters meiotic differentiation (Colomina et al., 1999). 
Nitrogen deprivation, the condition that represses the expression of the CLN genes and 
precludes entry into a mitotic cell division cycle, induces the expression of a cascade of 
meiosis-specific genes (Kassir et al., 2003). MATa/a diploid cells respond to starvation for 
nitrogen and glucose by activating expression of IME1 (Inducer of Meiosis) (Kassir et al., 
1988). The induction of IME1 requires input from a complex signaling pathway that 
integrates nutritional signals (nitrogen and glucose) and ploidy MATa and MAT a mating 
type genes (Kassir et al., 2003). MATa/a diploids will respond to this condition by initiating 
the program of meiotic differentiation, whereas haploids, MATa/a or MATa/a diploids will 
arrest in G1 and enter a quiescent state. IME1 is a meiosis-specific transcription factor that 
activates the expression of a large family of early meiotic genes many of which encode 
proteins that function in DNA replication and recombination (Mitchell, 1994). One of the 
key targets of Imel protein is the IME2 gene (Smith et al., 1990). Ime2 encodes a meiosis- 
specific protein kinase with amino acid sequence similarity to Cdks. Indeed it has been 
proposed that Ime2 may replace the functions of Cln-Cdkl complexes in meiosis (Dirick et 
al., 1998). While Ime2 may share some of the roles played by G1 cyclins it is unlikely that 
Ime2 directly replaces Cdk function in meiosis (Honigberg, 2004). However, Ime2 does play 
many roles in the meiotic differentiation program including driving the induction of early 
and middle phase meiosis-specific genes, and promoting inactivation of both APC CDH1 
(Anaphase Promoting Complex) and the Cdk inhibitor Sicl (Dirick et al., 1998; Bolte et al., 
2002; Sedgwick et al., 2006; Holt et al., 2007). The latter two functions are essential to allow 
the accumulation of active B-type cyclin-Cdkl complexes as APC CDH1 targets the cyclins for 
proteolytic degradation and Sicl binds to the cyclin-Cdkl complexes and inhibits their 
catalytic activity. 

Imel and Ime2 are key regulators of the meiotic differentiation program. Loss of Imel 
function causes diploid cells to arrest in G1 phase in response to starvation, and precludes 
initiation of the meiotic program (Kassir et al., 1988; Smith et al., 1990). Mutations that 
inactivate Ime2 result in diploid cells undergoing a prolonged Gl-phase arrest in response 
to starvation followed by delayed and defective DNA replication that leads to loss of 
viability (Foiani et al., 1996). 
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3. Initiation of premeiotic DNA replication 

The initiation of DNA replication is a tightly regulated process that begins with the stepwise 
assembly of a protein complex that ultimately recruits a DNA helicase and the DNA 
polymerases required for replication of the chromosomal DNA (Remus & Diffley, 2009). 
Although meiotic differentiation differs from mitotic proliferation, in some cases drastically, 
the fundamental aspects of the premeiotic DNA replication mirror the processes that occur 
during mitotic proliferation. 

DNA replication initiates from specific chromosomal locations referred to as origins of 
replication (Ori). These sequences are distributed along each of the chromosomes so that 
DNA replication can initiate from multiple sites. The relatively compact Oris of S. cerevisiae 
have been well defined and have an essential core sequence 5'-TTTTATGTTTA-3' and a set 
of three less well conserved accessory sequences that stimulate the efficiency of Ori 
activation (Marahrens & Stillman, 1992; Marahrens & Stillman, 1994). Data from numerous 
investigations support the contention that origins of replication are activated with variable 
efficiency in mitotically proliferating cells. Additionally, not every Ori is activated in every 
S-phase (McCune et al., 2008; Patel et al., 2008). Thus, Oris that are activated more efficiently 
have a higher probability of being activated at higher frequency. 

Origin of DNA replication usage has been less rigorously investigated during premeiotic 
DNA replication. Analysis of origin activation by two-dimensional gel electrophoresis 
suggested that in S. cerevisiae the same origins that are activated during mitotic proliferation 
are also utilized during premeiotic DNA replication (Collins & Newlon, 1994). This study 
did not investigate the efficiency or frequency of origin activation. A more recent 
investigation of origin usage in the fission yeast Schizosaccharomyces pombe using microarray 
technology revealed that the same origins of DNA replication are used during mitosis and 
meiosis in this yeast (Heichinger et al., 2006). These authors found an overall reduction in 
the efficiency of origin utilization in premeiotic DNA replication relative to proliferation and 
they speculated that this might at least in part explain why premeiotic DNA replication 
takes so much longer than mitotic DNA replication (2 hours for premeiotic S-phase vs. 20 
minutes for mitotic S-phase in S. cerevisiae) (Cha et al, 2000; Heichinger et al., 2006). 
Transcription initiating near or proceeding through Ori sequences can influence the 
efficiency of origin utilization (Donato et al., 2006; Mori & Shirahige, 2007). Additionally, the 
state of histone modification, in particular acetylation, has been correlated with the 
efficiency of Ori assembly and activation (Vogelauer et al., 2002; Weber et al., 2008; 
Unnikrishnan et al., 2010). Some of these conclusions were reached by using chemical 
inhibition of histone deacetylases (HDACs) to alter the modification state of the chromatin. 
These conclusions are controversial since altering HD AC activity can affect many aspects of 
gene regulation and one effect of HD AC inhibition is to reduce the supply of pyrimidine 
nucleotide precursors for DNA replication resulting in reduced fork speed and the 
recruitment of latent origins of DNA replication: (Gay et al., 2010). 

Since transcription patterns and the landscape of protein binding to DNA changes 
dramatically when cells transition from mitotic proliferation to meiotic differentiation it is 
not surprising that there would be a change in the usage of origins. It is unclear why there is 
an overall reduction in origin firing in meiotic cells but one possibility is that that some 
factor required for origin firing is limited during meiosis. Owing to the fact that meiotic 
differentiation initiates under starvation conditions where there may be no exogenous 
source of amino acids or nitrogen and that the total number of ribosomes decreases by 50% 
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or more (Hopper et al., 1974; Magee & Hopper, 1974) it is likely that the overall rate of 
protein synthesis is reduced in cells undergoing meiosis (Esposito et al, 1969). Under these 
conditions labile factors such as Dbf4 may become limiting to origin activation (Patel et al., 
2008). During mitotic proliferation Ori activation is temporally regulated with a subset of 
origins being activated early and another subset being activated later in S-phase 
(Raghuraman & Brewer, 2010). Activation of these origins displays distinct properties, early 
firing origins are activated even in the presence of hydroxyurea which inhibits or reduces 
activation of late origins (Santocanale et al., 1999). S-phase initiation requires Cdkl, and the 
S-phase form of S. cerevisiae Cdkl is activated by the cyclins Clb5 and Clb6. Early origins are 
activated in the absence of the S-phase cyclin Clb5 whereas late activated origins depend 
upon Clb5 (Donaldson et al., 1998). This particular characteristic has been attributed to the 
fact that Clb6-Cdkl complexes are present early in S-phase but Clb6 is rapidly degraded as 
cells progress through S-phase (Jackson et al., 2006). The role of cyclin Cdk complexes in 
triggering DNA replication will be considered in section 5 of this chapter. The mechanisms 
governing this temporal pattern of origin activation have been speculated upon but have yet 
to be clarified. Recent investigations suggest that earlier firing origins are more efficient and 
have a higher probability of firing in any given cell cycle. It has not yet been determined 
whether temporal regulation is imposed upon Ori activation during premeiotic DNA 
replication. 

4. Assembling the prereplication complex 

DNA replication is initiated from specific origins of replication by assembling complex 
multiprotein machines on origins of replication and then activating those complexes. 
PreReplication Complex (PreRC) assembly at origins of DNA replication has been 
extensively investigated in mitotically proliferating cells (Diffley, 2001). Contributions from 
several laboratories have revealed that assembly of the complex proceeds in a step-wise 
fashion (Diffley et al., 1995). In yeast the Ori sequence is constitutively bound by the Origin 
Recognition Complex (Ore) composed or the proteins Orel - Orc6 (Rowley et al., 1995). In 
contrast the Ore proteins assemble on the origins in G1-phase in metazoans and Orel is 
destabilized as the cells pass through S-phase (DePamphilis, 2005). The Ore complex marks 
the chromosomal origins of DNA replication, and serves as a nucleation site for the 
assembly of a functional PreRC (Diffley et al., 1995; Rowley et al., 1995). 

A key cell cycle regulated step in promoting further assembly of the PreRC is the 
accumulation of Cdc6 and Cdtl in late G1 phase. Transcription of the Cdc6 gene is induced 
in late G1 phase by the MBF transcription factor (Zhou & Jong, 1990). In mitotically 
proliferating cells the accumulating Cdc6 in conjunction with Cdtl binds to Ore complexes 
tethered to Ori DNA (Devault et al., 2002). The ATPase activity associated with Cdc6 is then 
required to promote loading of the Mem (Mini Chromosome Maintenance) complex on to 
the ORC complex (Stillman, 2005). The loading of Mem is a key step in assembling the origin 
complex. Failure of this loading or loss of Cdc6 function precludes further assembly of 
PreRCs and blocks the initiation of DNA replication (Piatti et al., 1995). Surprisingly, loss of 
Cdc6 function does not arrest chromosome metabolism. Rather, cells depleted of Cdc6 
simply skip over S-phase and progress to a reductional chromosome division leading to loss 
of viability (Piatti et al., 1995). 

The Cdc6 gene is subject to regulated transcription during meiotic differentiation where it 
accumulates during G1 phase (Ofir et al., 2004). Cdc6 is essential for progression through 
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premeiotic S-phase and meiotic recombination in both yeast and higher eukaryotes 
(Lemaitre et al., 2002; Lemaitre et al., 2004; Ofir et al., 2004). Loss of Cdc6 in meiotic 
differentiation precludes the initiation of premeiotic DNA replication but allows an 
attempted chromosome division without having replicated the DNA or forming a 
synaptonemal complex, leading to loss of viability (Hochwagen et al., 2005). Current 
thinking ascribes progression to meiotic chromosome divisions in the absence of S-phase to 
the idea that progression through G1 - S - M phases is restrained by the S-phase checkpoint 
which detects single stranded DNA or some aspect of DNA replication fork structure. A 
failure to initiate DNA replication results in no signal that could be detected by the 
checkpoint and so the checkpoint is not activated. Under these conditions the transcriptional 
program that drives meiotic differentiation proceeds and the cells progress to attempt the 
segregation of unreplicated chromosomes (Stuart & Wittenberg, 1998). 

The primary role of Cdc6 is to promote the association of the hexameric Mem complex with 
DNA bound Ore complexes (Tanaka et al., 1997). The Mem proteins Mem 2-7 form sub¬ 
complexes that can subsequently be loaded on to the Ore complex as a dimer of hexamers 
(Stillman, 2005). The Mem complex functions as an ATP-dependent DNA helicase (Ishimi, 
1997). By loading a dimer of Mem hexamers on to Ore complexes the hexamers can drive 
unwinding of the duplex DNA leading to bidirectional DNA replication. In addition to their 
role in DNA unwinding, the Mem helicase also provides binding sites for the recruitment of 
other factors that form the replication forks such as DNA polymerases (Forsburg, 2004). 
Components of the Mem complex accumulate cooperatively in the nucleus during G1-phase 
and are required for activation not only of S-phase but also of the S-phase checkpoint (Labib 
et al., 2001). Surprisingly, it has been reported that Mem proteins are not required for 
premeiotic DNA replication in yeast (Forsburg & Hodson, 2000). This study made use of a 
set of yeast strains harboring temperature sensitive mutations in a subset of the MCM genes. 
The opposite conclusion was reached by investigators using temperature sensitive degron 
mutants that eliminated or profoundly reduced the abundance of the Mem proteins upon 
activation of the degron (Lindner et al., 2002). The difference between these two 
investigations may simply have been the degree to which the Mem proteins could be 
depleted from the cells. 

Subsequent to the loading of the hexameric Mem complex in late G1-phase, assembly of the 
PreRC continues with the recruitment of several additional factors Cdc45, Dpbll, Sld2, Sld3, 
and the GINS complex (Remus & Diffley, 2009; Tanaka & Araki, 2010). Cdc45 has no 
identified catalytic activity but it makes specific contacts with the Mem complex and is 
essential for mitotic DNA replication (Hopwood & Dalton, 1996). Cdc45 is essential for 
premeiotic DNA replication and meiosis in plants (Stevens et al., 2004), and it is assumed to 
be essential for premeiotic DNA synthesis in yeast although this has never been formally 
reported. 

Cdc45 binding to the Mem complex at the preRC is linked to binding of Dpbll, Sld2, and 
Sld3. Dpbll is thought to act as a scaffold to allow association of Sld2, Sld3 and the 
replicative DNA polymerases Poll (DNA polymerase alpha) and Pol2 (DNA polymerase 
epsilon) with the preRC (Masumoto et al., 2000; Tanaka et al., 2007; Zegerman & Diffley, 
2007). Sld2, Sld3, and Dpbll are all essential for the initiation of DNA replication in 
proliferating cells (Araki et al., 1995; Kamimura et al., 1998; Kamimura et al., 2001). Stable 
assembly of Sld2 and Sld3 with Dpbll and the preRC is dependent upon Cdk activity. 
Phosphorylation of Sld2 and Sld3 on multiple Cdk sites promotes their stable binding to 
Dpbll and the preRC (Tanaka et al., 2007; Zegerman & Diffley, 2007). Temperature sensitive 
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sld2, and mutants display a profound defect in premeiotic DNA replication implying that 
they serve the same essential role in meiotic cells that they play in proliferating cells (Figure 1). 


SLD2/SLD2 sld2-6/sld2-6 



Fig. 1. Flow cytometry profiles of SLD2/SLD2 and sld2-6/sld2-6 mutant cells that have been 
induced to initiate meiotic differentiation at the non-permissive temperature of 36°C. The 
sld2-6 mutant arrests in Gl-phase - early S-phase and does not complete S-phase. The sld2-6 
temperature sensitive allele was provided by Dr. H. Araki and was introduced into the S. 
cerevisiae SKI genetic background by gene replacement. 

The GINS complex is a tetrameric protein assembly whose name derives from Go-Ichi-Ni- 
San; the Japanese numbers 5,1,2,3, in reference to Sld5, Psfl, Psf2, Psf3, the components of 
the complex (Takayama et ah, 2003). GINS associates with Sld2, Dpbll, and Pol2 in a sub¬ 
complex that is loaded on to the preRC (Muramatsu et al, 2010). The GINS complex is 
essential for the initiation of DNA replication and replication fork progression in 
proliferating cells (Takayama et al v 2003). The function of GINS in premeiotic S-phase has 
not been directly investigated, however the finding that psf2 mutants display meiotic 
chromosome segregation defects in S. pombe suggests that Psf2 and GINS also play a role in 
meiotic DNA replication (Gomez et al, 2005). 

The assembly of the correct preRC architecture is important for regulating the initiation of 
DNA replication and the ultimate function of this complex is to recruit and position the 
replicative DNA polymerases to the Ori where the chromosomal DNA will be unwound by 
the Mem complex and bidirectional DNA synthesis will initiate. Poll, the catalytic subunit 
of DNA polymerase alpha-primase is essential for meiotic DNA replication and likely serves 
the purpose of synthesizing the lagging strand as it does in proliferating cells (Budd et al., 
1989). Not unexpectedly, subunits of the DNA primase, Pril and Pri2, are also required for 
the synthesis of the lagging strand (Longhese et al., 1993). The leading strand DNA 
polymerase Pol3 (DNA polymerase delta) plays a role in meiotic DNA replication and 
meiotic homologous DNA recombination (Schild & Byers, 1978; Maloisel et al., 2008). Pol2 
(DNA polymerase epsilon) is also found at the replication fork and has a role as leading 
strand DNA polymerase in proliferating cells (Calzada et al., 2005). Surprisingly the 
catalytic activity of Pol2 is not essential in mitotically proliferating cells likely owing to 
redundancy with Pol3 activity (Kesti et al., 1999; Ohya et al., 2002). Both of the leading 
strand polymerases are expected to be required for premeiotic DNA replication but this has 
not been formally demonstrated. 



Meiotic DNA Replication 


213 


Thus, while preRCs are composed of primarily the same components during the initiation of 
S-phase in meiotic and proliferating cells, it should be noted that the composition of meiotic 
preRC has not been formally investigated and may have components remaining to be 
identified that impart meiosis-specific regulation to the activation of premeiotic DNA 
synthesis. 

5. Activating premeiotic DNA synthesis: the role of Cyclin-Cdk and Dbf4- 
Cdc7 

In proliferating cells the activation of DNA synthesis is dependent upon B-type cyclin-Cdkl 
and Dbf4-Cdc7. S. cerevisiae has a single cell cycle regulating Cdk, referred to as Cdkl. Cdkl 
is essential for all the major cell cycle transitions in S. cerevisiae (Nasmyth, 1993). Cdkl can 
bind to and is activated by nine different cyclins. CLN1, CLN2, and CLN3 are expressed in 
proliferating G1 phase cells. Six B-type cyclins CLB1 - CLB6 are expressed in S-phase, G2 
and mitosis (Lew & Reed, 1992). Cyclin binding to Cdkl serves the purposes of activating 
the catalytic function of the kinase through promoting a change in the conformation of Cdkl 
and they direct the Cdkl kinase activity against specific substrates (Morgan, 1995; Loog & 
Morgan, 2005). Cdkl is essential for activating S-phase in proliferating cells and during 
meiotic development (Reed, 1980; Benjamin et ah, 2003). Cdkl in complex with the cyclins 
Clb5 and Clb6 is the primary Cdkl complex that drives the initiation of DNA replication. 
Clb5 is the dominant cyclin of the Clb5, Clb6 pair as it is more stable and accumulates to 
greater abundance (Jackson et al., 2006). Although other B-type cyclins can activate Cdkl to 
promote S-phase in proliferating cells none do so as efficiently as Clb5 (Donaldson et al, 
1998; Cross et al., 1999; Hu et al., 2008). This is in part due to the consideration that Clb5 and 
Clb6 accumulate in late G1 whereas the other B-type cyclins Clbsl - 4, normally accumulate 
later in G2 and M-phase (Fitch et al., 1992; Richardson et al., 1992; Schwob & Nasmyth, 
1993). However, even when other cyclins are ectopically expressed in G1 they can induce 
DNA replication but with a significant delay relative Clb5 (Donaldson, 2000; Hu & Aparicio, 
2005). It is likely that this is due to substrate specificity. Clb5 and Clb6 have an amino acid 
motif (MRAIL) that has affinity for the substrate sequence RxL that is present in proteins 
like Sld2 and Sld3 that are high affinity substrates for Clb5 (Cross & Jacobson, 2000; Ubersax 
et al., 2003; Loog & Morgan, 2005). Hence Clb5 and Clb6 are referred to as S-phase cyclins. 
The key and possibly only essential role for Cdkl in activating DNA replication is to 
phosphorylate Sld2 and Sld3 to allow them to bind Dpbll and the preRC complex or 
stabilize their binding through creation of high affinity sites (Tanaka et al., 2007; Zegerman 
& Diffley, 2007). There may be additional roles in supporting DNA polymerase binding 
since the DNA polymerase epsilon subunit Dpb2 is also Cdk substrate (Kesti et al., 1999). 
Cdkl activity is required for several aspects of chromosome metabolism in the early stages 
of meiotic differentiation. Cdkl is strictly required for premeiotic DNA replication as 
demonstrated by the meiotic G1 arrest when mutants expressing an analog sensitive version 
of Cdkl are treated with the inhibitor 1-NM-PP1 (Benjamin et al., 2003). Cdkl is also 
required for the initiation of meiotic DNA recombination and synaptonemal complex 
formation (Henderson et al., 2006; Zhu et al., 2010). Both of these processes are dependent 
upon successful completion of premeiotic DNA replication. 

Temperature sensitive alleles of Cdkl cannot be inactivated sufficiently to block premeiotic 
DNA replication, which led to the idea that Cdkl activity was not required for progression 
through meiotic differentiation (Shuster & Byers, 1989). It is unclear why proliferating cells 



214 


DNA Replication and Related Cellular Processes 


need more Cdkl activity to initiate S-phase than do meiotically differentiating cells. One 
possibility is that in proliferating cells Cdkl is required to activate the transcription factors 
MBF (Mlul binding factor) and SBF (SCB binding factor) (deBruin et ah, 2004). These 
transcription factors induce the expression of the genes whose products function to promote 
S-phase and DNA replication CLB5, POL1, RNR1 etc. (Koch & Nasmyth, 1994). In contrast, 
many of the genes whose products are required for premeiotic S-phase are regulated by the 
meiosis-specific transcription factor Imel (Mitchell, 1994). SBF is not active in meiotic cells; 
however, MBF is active during meiotic G1 and promotes Gl-specific transcription; however, 
inactivation of MBF does not eliminate expression of the genes required for DNA replication 
and does not cause any defect in premeiotic DNA replication (Raithatha & Stuart, 2005). 
Additionally, there is evidence that a subset of MBF regulated genes are differentially 
regulated during mitotic and meiotic G1-phase (Raithatha & Stuart, 2005). 

In mitotically proliferating S. cerevisiae the S-phase cyclins CLB5 and CLB6 are not essential 
for S-phase activation, indeed cells will only arrest in G1 when all six of the B-type cyclins 
are inactivated (Schwob et al., 1994). In contrast CLB5 and CLB6 are strictly required for the 
initiation of premeiotic S-phase (Dirick et al, 1998; Stuart & Wittenberg, 1998). During the 
course of unperturbed meiotic differentiation Clb5 and Clb6 are the first cyclins to 
accumulate and these trigger DNA replication whereas the other B-type cyclin genes are 
regulated by Ndt80 and only accumulate during the middle phase of sporulation during 
pachytene (Chu et al., 1998). Even if Clbl or Clb3 is expressed early in meiosis under the 
regulation of a CLB5 promoter these cyclins fail to trigger premeiotic S-phase in a clb5 clb6 
mutant (J. DeCesare & D. Stuart unpublished observation). Suggesting that Clb5 and Clb6 
have some particular property that allows them to activate DNA synthesis during meiosis. 
Possibilities include subcellular localization, recruitment to specific substrates or substrate 
specificity. Clb5 accumulates in the nucleus both in proliferating and sporulating cells as 
expected since it effectively phosphorylates components of the preRC. The fact that any B- 
type cyclin can induce DNA replication in proliferating cells implies that all cyclin-Cdkl 
complexes can gain access to the appropriate substrates at least in proliferating cells. 
However, it is unlikely that simple subcellular localization gives Clb5 its specific ability to 
activate premeiotic DNA replication because Clb3 has been localized to the nucleus in both 
proliferating and sporulating cells so in principle it should have access to the same suite of 
substrates as Clb5-Cdkl complexes. The S-phase cyclins Clb5 and Clb6 display homology 
with other B-type cyclins in their Cdkl binding domain however the amino-terminal 
portions of each cyclin are highly diverged from each other as well as other B-type cyclins 
and it is not clear what sequences direct these cyclins to their proper nuclear location. 

Even though all cyclin-Cdkl complexes can enter the nucleus they may not all have access 
to the same substrates. The potential remains for compartmentalization within the nucleus 
or sequestration of cyclin-Cdkl complexes to locations within the nucleus where they may 
not have access to the critical substrates required to initiate premeiotic DNA replication. It is 
very likely that the architectural organization of the nucleus differs between mitosis and 
meiosis owing to the unique demands for synaptonemal complex formation, extensive 
homologous recombination, and the reductional meiosis I chromosome division. Within this 
altered nuclear landscape Clb5 and Clb6-Cdkl complexes may have a unique ability to 
interact efficiently with the substrate proteins required to assemble and fire origins of DNA 
replication. 

The molecular mechanisms that confer substrate specificity to cyclins have yet to be fully 
resolved (Miller & Cross, 2001). It is clear that Clb5 has a preference for a specific collection 
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of protein substrates that can interact with the cyclin MRAIL motif. Indeed Clb5 may harbor 
additional substrate interaction motifs that confer increased affinity to one or more meiosis- 
specific substrates. The cyclins that are shown to have specific functions in S-phase: S. 
cerevisae Clb5, and Clb6 and C. albicans Clb4, have a conserved motif of charged amino acids 
at their carboxyl-terminus that is not present in other cyclins (Figure 2). This sequence may 
have a role in the S-phase functions of these cyclins but no analysis of mutations in this 
motif have been reported. Premeiotic S-phase is influenced by proteins that are only 
expressed during meiotic differentiation (see next section) and it may well be that Clb5- 
Cdkl can specifically phosphorylate or interact with those substrates. While it is not 
surprising that there would be cyclins with specialized roles in meiotic differentiation the 
mechanism by which that specificity is conferred remain to be elucidated. 


Sc. Clb5: 
Sc. Clb6: 
Kl. Clb : 
Ca. Clb4: 


374 IHDEAFQSLCIDLVKDIASSKTHLDSLILKYKKPRYGS 
330 ESDPAFKDFISELVEDIAVPDTNLDSLRLKYKKPKHGM 
340 DNEIELKELSQILINDIAKPTTQLNALIHKYKKIGTWN 
433 YTEKQLQPLADVLLENCRHAEINHKAIFEKYKERRYRK 
+ + + L+ + KYK 


Fig. 2. A conserved cluster of charged amino acids near the carboxyl-terminus of the S. 
cerevisae S. phase cyclins Clb5 and Clb6. A similar cluster is present in the C. albicans Clb4 
(an S-phase cyclin) and a cyclin from K. lactis (r_klactIV3887) whose function has not been 
confirmed. This charged cluster is not found in other S. cerevisiae cyclins. 

In addition to cyclin-Cdkl, activation of DNA replication requires the kinase activity 
associated with the Dbf4-Cdc7 protein kinase (Masai et al., 1999). Cdc7 is the catalytic 
subunit of the protein kinase whereas Dbf4 is an activating and substrate targeting subunit 
similar to a cyclin (Jackson et al., 1993). The enzyme complex is referred to as DDK (Dbf4 
Dependent Kinase). In S. cerevisiae Dbf4 accumulates in late G1 phase and binds Cdc7, 
which is constitutively expressed. The complex is then recruited to preRCs through 
interactions with both Mem and Ore proteins (Dowell et al., 1994). The catalytic activity of 
Cdc7 is required to activate DNA replication in proliferating cells as inactivation of cdc7 
temperature sensitive mutants results in a strict Gl-S-phase arrest (Newlon & Fangman, 
1975). This phenotype is replicated by the analog sensitive alleles of Cdc7 that can be 
specifically inhibited by the small molecule PP1 [4-amino-l-tert-butyl-3-(p- 
methylyphenyl)pyrazolo [3,4-d]pyrimidine] (Wan et al., 2006). Recruitment of Dbf4 to the 
PreRC is essential for the activation of DNA replication, mutations that allow Dbf4 to bind 
Cdc7 but not the preRC cause a failure in DNA replication (Duncker et al., 2002). 

The role of Cdc7 in premeiotic DNA replication has been controversial since both cdc7 
temperature sensitive mutants and analog sensitive mutants arrest post S-phase prior to the 
onset of meiotic recombination (Schild & Byers, 1978; Buck et al., 1991; Wan et al., 2006). In 
contrast the activating subunit Dbf4 is essential for premeiotic DNA replication (Valentin et 
al., 2006). Dbf4 is a highly unstable protein that can be rapidly depleted from cells unlike 
Cdc7, which is very stable (Ferreira et al., 2000). It may be the case that like Cdk, there is a 
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lower threshold of Cdc7 kinase activity required in meiosis. Nonetheless the requirement for 
Dbf4 argues that Dbf4-Cdc7 activity is required for premeiotic DNA replication. 

The essential substrates of Cdc7 have been intensively sought after. Genetic analysis 
revealed that a mutation in Mcm5 (Mcm5-P 83 L) could bypass the need for Cdc7 or Dbf4 both 
during mitotic proliferation and for premeiotic DNA replication (Hardy et al., 1997; Matos et 
al., 2008). Interestingly DDK does not appear to phosphorylate Mcm5. Rather it seems likely 
that phosphorylation of other components of the MCM complex Mcm2, Mcm4 and Mcm6 
by DDK leads to a conformational change in the complex allowing its activation (Hoang et 
al, 2007). The P 83 L mutation in Mcm5 may result in a similar change that allows activation 
of the complex in the absence of DDK phosphorylation (Hoang et al, 2007). Currently the 
best candidates for the essential substrates of Dbf4-Cdc7 are Mcm2, Mcm4 and Mcm6. DDK 
can phosphorylate these Mem proteins both in vitro and in vivo (Lei et al., 1997; Masai et al., 
2006; Francis et al., 2009). Indeed Cdc7 preferentially phosphorylates the Mem proteins 
within the context of the PreRC and chromatin bound Mcms are much better Cdc7 
substrates than the free Mem proteins (Francis et al, 2009). The role of Mem 
phosphorylation is not entirely clear but there is evidence that it stabilizes the binding of 
Cdc45 and perhaps other components of the preRC (Zou & Stillman, 2000). Additionally, it 
has been posed that phosphorylation of the Mem complex may lead to a conformational 
change that relieves autoinhibition of the helicase activity (Sheu & Stillman, 2010). 

DDK has additional roles in meiotic chromosome metabolism. Strains that harbor cdc7 
mutations are defective in the formation of DNA double strand breaks (DSBs) that initiate 
meiotic homologous recombination (Wan et al, 2006). DDK phosphorylates several residues 
on Mer2 protein allowing it to stably associate with the chromatin and trigger double stand 
breaks (Sasanuma et al., 2008; Wan et al., 2008). Phosphorylation of Mer2 by DDK is 
dependent upon prior phosphorylation of Mer2 by Cdkl (Sasanuma et al., 2008; Wan et al., 
2008). Thus, the initiation of DSB formation and homologous recombination requires 
integration of inputs from both Cdkl and DDK kinases. 

6. Does DNA replication differ between meiosis and mitosis? 

The process of initiating DNA replication in proliferating cells and cells undergoing meiotic 
differentiation are fundamentally the same. All of the proteins required to initiate S-phase in 
mitotically proliferating cells seem to also be required for premeiotic S-phase although not 
all have been rigorously tested in meiotic cells (Simchen, 1974). Premeiotic S-phase displays 
a reduced requirement for Cdk and DDK activity but these enzymes are still required. Based 
upon genetic and molecular studies it appears that the preRC assembled in cells initiating 
premeiotic S-phase is similar to or the same as that assembled in proliferating cells. 
However, a variety of investigations have revealed significant differences between mitotic 
and premeiotic S-phase. One fundamental physiological difference between the two 
processes is that proliferating cells display a specific cell size requirement before they can 
transition from G1 to S-phase (Cook & Tyers, 2007). Premeiotic DNA replication and indeed 
the entire meiotic differentiation process in S. cerevisiae is independent of cell size control 
(Stuart, 2008). This issue is somewhat controversial as some investigators have observed cell 
size regulation of meiotic differentiation but the differences may be related to the genetic 
background of the strains used in the experiments (Calvert & Dawes, 1984; Day et al., 2004). 
Genetic studies have revealed several genes whose products are required for meiotic but not 
mitotic S-phase: SP07, SP09, MUM2, and CDC55 (Esposito & Klapholtz, 1981; Davis et al. 
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2001; Nolt et al, 2011). SP07 and CDC55 encode subunits of protein phosphatases and may 
be candidates for meiosis-specific regulators of DNA replication. MUM2 displays genetic 
interactions with both the replicative DNA polymerase POL1 and ORC2 suggesting that this 
protein may participate in activating premeiotic origins of DNA replication; however, the 
basis for its requirement has yet to be reported (Davis et al., 2001). The function of SP09 
remains to be characterized. Currently the molecular basis for the meiosis-specific 
requirement for these proteins remains unknown. 

Premeiotic S-phase is also dependent upon Ime2, a meiosis-specific kinase whose amino 
acid sequence is similar to Cdks (Foiani et al., 1996). Loss of Ime2 function causes a 
profound delay in premeiotic S-phase. Ime2 may not directly regulate the initiation of 
premeiotic DNA replication because ime2 mutants display a plethora of defects including 
reduced recombination, reduction in early and middle meiotic gene expression (Mitchell et 
al., 1990; Benjamin et al., 2003). Ime2 is implicated in the inactivation of the APC CDH1 an 
event necessary to allow accumulation of B-type cyclins (Bolte et al., 2002; Holt et al., 2007). 
Ime2 is also implicated in destabilizing Sicl although this effect may not be direct (Dirick et 
al., 1998; Sedgwick et al., 2006). 

One of the most easily observable differences between premeiotic S-phase and S-phase in 
proliferating cells is the prolonged time frame for premeiotic DNA replication. S-phase lasts 
for nearly 2 hours in meiotic cells whereas the process lasts a mere 20 minutes in 
proliferating cells (Cha et al., 2000). Since the same origins of replication are utilized in 
meiotic and mitotic S-phase the most likely explanation for the slow progress of DNA 
replication is that origins are activated with lower efficiency as observed in fission yeast 
(Heichinger et al., 2006). Alternatively, the rate of fork movement may be slower in meiotic 
cells. Experiments using metabolic labeling and DNA fiber autoradiography suggested that 
the rate of replication fork movement is the same in meiotic and mitotic cells (Johnston et al., 
1982). However, during premeiotic DNA replication the replication forks must contend with 
the assembling Spoil DSB complexes on the chromatin. Replication forks have been shown 
to pause when encountering stable protein-DNA complexes (Azvolinsky et al., 2009). It is 
possible that replication fork pausing caused by the DSB complexes slows the progression of 
premeiotic DNA replication. Indeed deletion of SPOll shortens premeiotic S-phase 
supporting this contention (Cha et al., 2000). 

7. Premeiotic DNA replication and meiotic recombination: What’s the 
connection? 

The ultimate goal of the meiotic differentiation process is to produce the haploid progeny 
required for sexual reproduction. One of the hallmarks of the meiotic process is elevated 
rates of recombination that effectively "shuffle the genome" allowing for relatively unbiased 
assortment of the parental genetic material. The process of generating haploid progeny 
yields at least two significant advantages to S. cerevisiae. First the genetic assortment that 
occurs increases the likelihood of generating at least a subset of progeny that are in 
possession of increased fitness relative to the parent. Second by passing through a haploid 
stage any deleterious recessive mutations that have accumulated in the diploid parent may 
be eliminated since a haploid harboring this allele will be inviable, a type of genetic "quality 
control' mechanism. 

Meiotic recombination has been extensively studied in S. cerevisiae owing to the ease with 
which meiotic progeny can be separated and tracked. Meiotic DNA recombination is 
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generally predicated upon the successful completion of premeiotic DNA replication. The 
DNA double strand breaks (DSBs) that initiate meiotic recombination only form after the 
DNA has been replicated. This order of events is clearly important since the formation of 
DSBs prior to DNA replication would be catastrophic for genome integrity. This dependent 
relationship was elegantly demonstrated by a study that removed origins of DNA 
replication from the left arm of chromosome III, effectively delaying the completion of the 
replication of that arm of the chromosome. While DNA replication and DNA double strand 
break formation occurred with the expected normal timing on the right arm of chromosome 
III, DNA replication and DSB formation was profoundly delayed on the left arm (Borde et 
al, 2000). 

A variety of possible mechanisms have been put forth to explain this relationship such as 
altered chromatin structure of the replicated DNA allowing DSB formation or access to the 
DSB forming complex (Berchowitz et al., 2009). Another potential explanation is that the 
DNA replication machinery may interact with and "license" or activate the protein complex 
that produces the meiotic DNA double strand breaks. 

DNA DSBs are formed by a protein complex that binds to the chromatin in meiotic Gl- 
phase. The complex is minimally composed of Spoil, Recl02, Recl04, Ski8 Mer2, Mel4, 
Recll4, Mrell, and Rad50 (Keeney, 2001). Some of the integral components of the DSB 
formation complex are substrates for the S-phase protein kinases Clb5-Cdkl and Dbf4-Cdc7. 
Mer2 is phosphorylated by Clb5-Cdkl and this modification acts as a priming 
phosphorylation to trigger phosphorylation by Dbf4-Cdc7 (Henderson et al., 2006; 
Sasanuma et al., 2008; Wan et al., 2008). These phosphorylation events are essential for 
meiotic DSB formation and recombination. A mutated Mer2 lacking Cdk phosphorylation 
sites is defective in interacting with other components of the DSB forming complex 
(Henderson et al., 2006). Could Cdkl and DDK be the connection between premeiotic DNA 
replication and meiotic recombination? 

A parsimonious model to explain the temporal regulation of DSB formation and the 
involvement of Clb5-Cdkl and Dbf4-Cdc7 could be that when Clb5-Cdkl accumulates in 
premeiotic G1 phase it phosphorylates proteins required to assembly the PreRC (Sld2 and 
Sld3) as well as the DSB initiation complex (Mer2). Dbf4 accumulating in premeiotic Gl- 
phase binds to Cdc7 and recruits the active Cdc7 to the PreRC. Upon the initiation of DNA 
replication the Mem complex and other residents of the replication forks move along the 
chromosomes and Dbf4-Cdc7 moves along with the fork. As replication forks pass and 
possibly pause at sites of DSB initiation complexes Dbf4-Cdc7 phosphorylates members of 
the complex allowing for complete assembly or activation of the complex such that DSB 
formation can ensue following the passage of the replication fork. 

This model is consistent with most published data however cdc6 mutants that fail to assemble 
PreRCs form DSBs despite the absence of any detectable DNA replication (Hochwagen et al., 
2005). Although this mutation decouples replication and recombination it is not inconsistent 
with the proposed model. The Dbf4-Cdc7 complex normally becomes tethered to the PreRC 
following loading of the Mem complex by Cdc6. In the absence of Cdc6 the Mem complex fails 
to load on to the DNA and Dbf4-Cdc7 complexes are thus not sequestered and can proceed to 
phosphorylate target proteins such as the DSB initiation complex factors hence allowing DSB 
formation. The localization and activity of Dbf4-Cdc7 has not been determined in cdc6 
mutants and so this model currently remains a matter of conjecture. Indeed it is also possible 
that the DNA replication checkpoint regulates the DSB formation complex and impedes the 
formation of DSBs until DNA replication is complete (Tonami et al., 2005). 
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8. Conclusion 

DNA replication is an integral component of the meiotic differentiation program and is 
essential to the integrity of chromosome cohesion, meiotic recombination, and chromosome 
segregation. Defects in meiotic DNA replication have consequences that range from 
infertility and death of the progeny gametes to genomic instability and aneuploidy that can 
manifest as genetic diseases, birth defects and cancer. Despite our growing understanding of 
the mechanisms that govern DNA replication in mitotically proliferating cells, the 
regulation of premeiotic DNA replication remains largely uncharacterized. Several specific 
topics hold great promise for new discovery in the future, in particular the role of cyclin- 
Cdkl in triggering premeiotic DNA synthesis, the regulation of preRC assembly in meiotic 
cells, and the relationship between DNA replication and homologous recombination. 
Premeiotic DNA replication is subject to several distinct forms of regulation involving 
meiosis-specific regulatory proteins and meiosis-specific requirements for cyclin-Cdk and 
DDK. Future investigations will likely focus on clarification of substrate specificity, 
subcellular localization, and protein-protein interactions involving cyclin-Cdkl and DDK. 
Additionally, premeiotic DNA replication requires some proteins that are meiosis-specific. 
Mum2 has no close orthologs in other organisms and as yet has no clearly defined role but 
that it is required for premeiotic DNA replication in S. cerevisiae. PreRC complexes have not 
yet been closely examined in cells undergoing meiotic differentiation. Biochemical 
characterization of these complexes through proteomic investigation may yield a wealth of 
new information about the components of meiotic replication complex and how premeiotic 
DNA replication is regulated. The biochemical relationship between the DNA replication 
machinery and the recombination machinery remains to be characterized and such 
investigations will certainly lead to an understanding of how DNA DSB formation is 
restrained until the DNA replication has been replicated. The molecular mechanisms 
governing the distinct regulation of premeiotic DNA replication remain to be elucidated but 
these will undoubtedly be geared to accomplishing the specific developmental aims of 
meiotic differentiation, namely the production of haploid gametes. 
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1. Introduction 

It is well established that the presence of the diploid chromosome set in cell nuclei of the 
majority of tissues is a characteristic of vertebrates. This peculiarity was supplemented by 
the rule of the relative constancy of DNA in nuclei with diploid chromosome set (Boivin et 
al., 1948). However, some cells contain doubled or multiplied chromosome set as compared 
to other cells of the organism or other individuals of the species. As distinct from the 
generative polyploidy in which polyploidy covers all cell types including gametes, somatic 
polyploidy extents to cells of several tissues (Brodsky, Uryvaeva, 1985). 

At present there are numerous data that the cells of some tissues of animal and plants 
appear to be polyploid. The incidence of polyploidy in the tissues of animals and plants as 
well as levels of ploidy is described in detail in quite a few books and reviews (Geitler, 1953; 
Tschermak-Woess, 1971; Nagl, 1978, 1981, 1995; Brodsky, Uryvaeva, 1985; Raikov, 1982; 
Zybina, 1986; D'Amato, 1989; Zybina, Zybina, 1996; Edgar, Orr-Weaver, 2001; Ravid et al., 
2002). A large body of data on incidence of polyploidy in animal and plant tissues enabled 
W. Nagl to make a conclusion that polyploidy is a widespread phenomenon characteristic of 
the overwhelming majority of eukaryotes (Nagl, 1978; 1981; 1995). According to Nagl, 
Vertebrata represent a specific taxon that differs from other eukaryotes in the extent of 
polyploidy among the differentiated cells. Thus, in insects and higher plants, nearly all 
normal, differentiated cells are polyploid. The majority of Mammalia and, most probably, 
other Vertebrates, are characterized by the diploid chromosome set in most cells including 
both proliferative and the tissue-specific cells. Polyploidy in Vertebrata is present in many 
cell types, i.e., hepatocytes, cardyomyocytes, megakaryocytes and some others (Brodsky, 
Uryvaeva, 1985; Hancock et al, 1993; Kudryavtsev et al, 1993; Nagl, 1995; Anatskaya et al, 
2001). It suggests that propagation of vertebrates is, probably, due to an advantage of the 
mitotic cycle that involves a machinery of accurate reproduction of diploid cells over other 
cell reproduction cycles that result in polyploidization. It cannot be ruled out that the 
biological advantage of mitosis has made it possible the wide distribution of Vertebrata. 
Nevertheless, genome multiplication characteristic of wide range of highly differentiated 
cells in different taxons suggests an undoubtedly important biological significance of 
polyploidy in the lifespan of some specific cell types. 
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Most of suggestions consider polyploidy as a factor promoting some tissue-specific 
functions of some cell types (Nagl, 1978,1985; Brodsky, Uryvaeva, 1985; Edgar, Orr-Weaver, 
2001; Lee et al., 2009). In some mammalian tissues the number of polyploid cells increases 
under conditions of the physiological stress: in cardiomyocytes in heart disease 
(Rumyantsev, 1991); in the aorta vascular smooth muscle cells of hypertensive animals 
(Owens, Schwartz, 1982; Ravid et al, 2002; Hixon, Gualberto, 2003; Nagata et al, 2005; Gui 
et al., 2007); in the smooth muscle cells of myometrium during pregnancy (Heiden, James, 
1975), in thyrocytes and cells of the adrenal gland with age (Auer et al., 1985; Ravid et al., 
2002). Besides, an increase in proportion of polyploidy cells correlates with aging and cell 
senescence (Yang et al., 2007; McCrann et al., 2008; Celton-Morizur, Desdoutes, 2010). 
Switching to polyploidy, as a rule, results in loss of capability for cell divisions that is also a 
characteristic of the differentiated cells. An increase of cell size leads to a change of cell 
metabolic characteristics. Switch to endoreduplication cycle in which cell growth is gained 
without periodic reorganization of cytoskeleton and cell contacts, concomitant to the 
entrance into mitosis allows cell to continue cell growth and proliferation without 
prominent structural transformation. Such a way of cell reproduction may be important for 
the highly specialized cell types. Therefore, it worth considering different types of cell cycle 
modification as possible ways to gain different levels of genome multiplication characteristic 
of different cell types of animals and plants. 

1.1 Abbreviations 

AER - agranular endoplasmic reticulum, GER - granular endoplasmic reticulum, NE - 
nuclear membrane; AL - annulate lamellae; BB - Barr body; GCB - gonosomal chromatin 
body 

2. General considerations on the somatic polyploidization mechanisms 

The clearest notions on ways of somatic polyploidization were developed basing on the cell 
cycle concept (Howard, Pelc, 1953; Nagl, 1978; Brodsky, Uryvaeva, 1985; Edgar, Orr- 
Weaver, 2001; Lee et al., 2009). 

2.1 Restitution (uncompleted) mitoses 

Switching off the last step of mitosis - cytotomy - may be the first step to polyploidy: 
binucleate cell is formed, each nucleus containing diploid chromosome set. In the next cell 
cycle and mitosis two metaphase plates may unite forming the common plate. If the mitosis 
comes to the end, it results in two mononucleated cells with tetraploid chromosome set. 
Such cycles may repeat and lead to formation of mono- and binucleate cells of the higher 
ploidy level: 


2c - (2c x 2) - 4c - (4c x 2) - 8c - (8c x 2) 

Such a ploidy level was proved by V.Ja. Brodsky basing on the dynamics of transition of 
mono- and binucleate cells of different ploidy using combination of cytophotometry and 3 H- 
thymidine labeling of DNA replication (Brodsky, Uryvaeva, 1985). Acytokinetic mitoses 
with subsequent polyploidizing mitosis is rather widespread way of polyploidization, for 
example, in liver, myocardium, retina pigment epithelium (Rumyantsev, 1982). Similar 
ways were described later by using time-lapse video images in the aortic vascular smooth 
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muscle cells, however, as one of different ways of polyploidy genome gaining (Nagata et al., 
2005). An essential point, in our opinion, is "mitotic origin" of polyploid cells: rise of a 
polyploid nucleus requires a diploid cell to enter mitosis. 

Block of mitosis may occur somewhat earlier - at the prophase, metaphase, and anaphase. 
The cells reaching these phases then undergo chromosome despiralization and return into 
interphase. As the chromosome do not move to the poles, the cells retain the doubled 
chromosome set and become polyploid. Such a cycle was called the restitution cycle (Nagl; 
1978; 1995) or the incompleted polyploidizing mitosis (Uryvaeva, 1979). Prophase restitution 
was described in the root tip cells of onion Alliun cepa and A. carinatum (Nagl, 1978). In case 
of prophase restitution, the cells enter prophase, then return to interphase via telophasic 
decondensation. 

Metaphase restitution was found in the onion root tip cells in which mitosis was blocked by 
colchicines; spontaneous metaphase restitution was observed in the pea endosperm (Nagl, 
1978). The restitution mitoses can be recognized by overcontraction of chromosomes, their 
enhanced stickiness and failure of ordered arrangement being due to misfunction of the 
spindle (Nagl, 1978). 

Anaphase restitution was also observed in the onion root tip cells under the colchicine 
treatment, and spontaneously - in the suspensor of Lathyrus latifolius (Nagl, 1978). In this 
case of chromosome bridges, these are probably responsible for non-disjunction of the 
daughter nuclei. In the next G1-phase a cell with a dumbbell-like nucleus or a binucleate cell 
with tightly attached nuclei is formed. Metaphase and anaphase restitution were also 
described in cancer cells (Therman et al., 1986). 

Uncompleted polyploidizing mitoses were also found in the decidual cells in the rat 
endometrium - both in the normal pregnancy and in the induced deciduomata (Zybina, 
1986). Uncompleted mitosis with delays of its middle phases was found in the vertebrate 
tissue, in particular, in megakaryocytes and cancer cells (Brodsky, Uryvaeva, 1985; Nagl, 
1995). 

Spontaneous polyploidization in human tissues often includes different types of restitution 
mitoses that seem to result from disorder in many mitotic events; these events also including 
reverse cytokinesis. Thus, the hypertensive aortic vascular smooth muscle cells (VSMC) at 
the time-lapse video showed delayed mitosis as compared to the normal one (Nagata et al., 
2005). In most cases, sister chromatids were normally segregated and the cleavage furrow 
was also normally formed, but cytokinesis progression was stopped. Thereafter, reversion of 
cytokinesis occurred, and ultimately two nuclei were reversed into a single binucleate cell. 
In some cases, the cleavage furrow was formed again, and finally the VSMC were divided 
into two daughter cells. In other, rare, cases (4 times per 100 time-lapse images), there 
occurred exit from mitosis without cell division that resulted in polyploid mononuclear cell 
formation. In these cases, the cleavage furrow was formed even before sister chromatid 
disjunction and/or segregation that resulted in chromosome separation into two parts, the 
parts being bound by a chromosome bridge. These mitoses resulted in polyploid cells with 
single irregular-shaped lobular nuclei (Nagata et al., 2005). At last, there occurred some 
mitoses with a shallow cleavage furrow and missegregation of sister chromatids; thereafter 
the furrow disappeared and mononuclear polyploid cell with round nucleus was formed. 
Immunolabelling of and y-tubulin proved that the cell retained mitotic spindle and 
centrosomes that allow mitosis progression, Aurora B was also found to be normally 
expressed, but another chromosome passenger protein Survivin was downregulated. Lack 
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of Survivin expression was, probably, the reason for disturbance of the Aurora B/Survivin 
complex formation, Aurora B kinase dysfunction and failure of chromosome movement 
leading to polyploidization. Thus, failure of expression of one regulator of mitotic 
machinery may exert a pleiotropic effect on the proceeding and result of mitosis. Such a 
pleiotropic effect is the reason for various mitotic modifications and aberrations in cancer 
cells (Therman et al., 1986; Therman, Kuhn, 1989) and for numerous cases of spontaneous 
and induced mitotic restitution (Nagl, 1978). 

Regulation of the tissue-specific restitution cycle is not quite clearly elucidated. At present, 
processes of transition to polyploidization similar to uncompleted mitoses are considered as 
overcoming of mitotic spindle assembly checkpoint (Cahill et al, 1998; Gualberto et al, 
1998). The former delays the onset of anaphase and the exit from mitosis. The mitotic 
checkpoint accomplishes its function by controlling activity of the cell cycle regulatory 
machinery composed by the Mitosis Promoting Factor (MPF), the Anaphase Promoting 
Factor, and proteasome. MPF is a protein complex that consists of a cyclin-dependent kinase 
p 34 cdc 2 / cyclin B, and associated proteins (Hixon, Gualberto, 2003). MPF induces formation 
of a mitotic spindle by phosphorylating components of the mitotic spindle including 
microtubule-based motor proteins. Cyclins B are known to be involved in the microtubule 
metabolism (Nurse, 1991; Ookata et al., 1995). Degradation of CyclinB/p34 cdc2 correlates 
with entrance into the normal ana- and telophase (Lehner, O'Farrel, 1990; Holloway et al., 
1993). The Anaphase Promoting Complex (APC) and Proteasome control metaphase- 
anaphase transition and the exit from mitosis. At the onset of anaphase, the activity of 
separins contribute to release cohesins from chromosomes, allowing for segregation of 
bivalents to opposite poles. Separin activity is blocked by securins. At the karyokinesis the 
APC-complex promotes destruction of secutins and cyclin B leading to chromosome 
segregation and cytokinesis. Therefore, mutation in p34 cdc2 in the phosphorylation sites 
blocks mitosis progression (Compton et al., 1995). 

Some data about regulation of transition to polyploidization cycles were obtained on the 
model of VSMC. The aortic VSMC of mice genetically prone to hypertension were treated 
with colcemid to estimate effect of the spindle checkpoint mechanism. While VSMC from 
normotensive animals were able to arrest the cell cycle progression in mitosis, VSMC 
isolated from hypertensive animals re-entered the cell cycle in the absence of chromosomal 
segregation and undervent polyploid rounds of DNA synthesis. While VSMC of 
normotensive rats accumulated Cyclin B (mitotic arrest) in response to a mitotic spindle 
inhibitor, unscheduled degradation of Cyclin B was observed in VSMC isolated from 
capacitance arteries of spontaneously hypertensive animals (Hixon, Gualberto, 2003). In this 
case, exit from mitosis may proceed in the absence of chromosome segregation and 
karyokinesis thereby leading to cell cycle re-entry and polyploidization. 

2.2 Endocycles 

More profound reduction of mitotic cycle, as compared to uncompleted mitosis, results in 
endocycles. This term covers all phenomena of genome multiplication that are accomplished 
without nuclear envelope disappearance (Nagl, 1978; 1981; 1995; Therman et al., 1983; 1986; 
Edgar, Orr-Weaver, 2001). The endocycles include classic and angiospermal endomitosis, and 
endoreduplication or polyteny (Geitler, 1953; Nagl, 1978; 1995; Kiknadze, Istomina, 1980; 
Zybina, Zybina, 1996; 2005). 
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2.2.1 Endomitosis 

In the course of the classic endomitosis, chromosomes undergo successive modifications 
similar to that in mitosis, but without nuclear envelope disappearance, metaphase plate 
arrangement, and anaphase movement. In this case, there occur chromosome condensation, 
sister chromatid separation that is followed by their decondensation (Geitler, 1953; Nagl, 
1978; 1995). Morphology of the endomitotic chromosomes is somewhat different from the 
mitotic ones by less sharp contours. 

Classic endomitosis was described in invertebrate tissues: in a Heteropteran Gerris lateralis 
(Geitler, 1953); in the parietal cells of testicular follicle of grasshoppers (Kiknadze, Istomina, 
1980), in the albumen gland and neurons of the snail Succinea lauta (Anisimov, 1997 a, b, c; 
Anisimov, Kirsanova, 2002; Anisimov, 2005). Recently, the latter case was studied in detail. 
In the transition from endointerphase to endoprophase and, then, to endometaphase, 
nuclear envelope was retained, autosomes did not reach the complete shortening and 
thickening like in mitosis; meantime, the spindle and chromosome movement were not 
observed. In endoanaphase there occurred splitting of chromosomes into chromatids 
without spindle formation. In the transition to endotelophase and endointerphase the split 
chromatids stayed to lie in parallel, meantime undergoing progressive decondensation, so in 
endointerphase, numerous paired chromocenters were seen in the nucleus. In DNA 
synthesis study using 3 H-thymidine, the cycle of endomitosis was revealed, and duration of 
phases of endointerphase - Gl, S, and G2 - was determined (Anisimov, 1997a). Each phase 
of endomitotic cycle was characterized by specific chromosome structure and arrangement, 
and in G1-, S-, and G2-phases, numerous chromocenters were observed, the degree of their 
condensation being characteristic of each phase of the endointerphase. 

Endomitosis was also observed in the trophoblast of placenta and hydatidiform mole (Sarto 
et al., 1982; Therman et ah, 1983,1986). In these cells there occurred pictures of endomitosis, 
the degree of chromosome condensation differing in different cells. In the ^-thymidine- 
labelled cells, paired endomitotic chromosomes were seen, their condensation was less 
pronounced than in the unlabelled cells. Therefore, at the endointerphase, a partial 
chromosome decondensation and replication occurred (Therman et al., 1986); different 
chromosome regions probably were decondensed at different intervals. 

The transcriptional activity of endomitotic nuclei appeared to be sufficiently high 
(Anisimov, 1997c). The highest 3 H-uridine labelling in snail tissues was observed in G1-, S-, 
and G2-phase, i.e., during the whole endometaphase. In endointerphase and endoprophase 
the 3 H-uridine labeling decreased up to minimum, so that in endometaphase it accounts for 
4% of the endointerphase level. In endotelophase a fast increase of the 3 H-uridine labeling 
was observed. We can conclude that transcriptional activity persists throughout the whole 
endomitotic cycle, but undergoes cyclic transformation, so that at the endometaphase it is 
minimal. Electron microscopic observation also demonstrated a certain degree of 
chromosome condensation at all endomitotic stages in cells of the albumen gland of the 
snail. The uncomplete condensation is accompanied by the radial looping of thin chromatin 
threads running from the endochromosome axis; perichromatin granules were seen at the 
loops. Such an organization of endochromosomes resembles lampbrush chromosomes 
found in diplotene oocytes in many vertebrates (Callan, 1986). The extent of looping also 
seems to undergo the condensation cycle: maximal number of loops was observed at 
endointerphase, minimal - at endometaphase (Anisimov, 1997b, c). Thus, classic 
endomitosis represents a variation of the cell cycle that retains cycle of replication. 
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chromosome condensation, and transcriptional activity; however, this cycle is "shortened", 
some events corresponding to mitosis are omitted. 

It seems to be of interest that throughout the endomitotic cycle not only the nuclear 
envelope, but also nucleolus persists; the level of transcription activity of nucleolar 
organizer varies at different phases of the endomitotic cycle (Anisimova, Anisimov, 2002). 
The nucleolar activity also varies depending on the functional state of the cell, in particular, 
of the differentiation rate and direction (Anisimova, Anisimov, 2002). 


Endomitosis 


Polyteny 




= • 



• DNA replication 
. transcription 


Fig. 1. Scheme of endocycles in endomitosis and polyteny. In the endomitotic cycle 
replication, chromosome condensation/decondensation and transcription are retained; 
replication is accomplished at the S-phase; transcription is active at the G1-, S, and G2- 
phases. In endometaphase (endo-mph) and endoanaphase (endo-aph) endochromosomes 
condense and transcription declines up to minimum; sister chromatid segregation takes 
place at the endoanaphase. Arrow indicates transformations of one pair of sister 
endochromosomes in the course of the endomitotic cycle. In polyteny chromosome 
replication is accomplished in S-phase; cyclic changes of chromosome condensation and 
transcription take place; chromatid segregation is not observed. 

Apart from the classical or "insect-type" endomitosis, the so-called "angiospermal 
endomitosis" was observed in cells of many higher plants (Tschermak-Woess, 1971; Nagl, 
1978). In this case the cyclic chromosome condensation appears to be much lower 
(rudimentary) than in the insect-type endomitosis. Therefore, it is rather difficult to detect 
different endomitotic phases as compared to the classic endomitosis. 

It is to be specified that endomitotic pictures do not always reflect genome multiplication 
via endocycle progression. In cells of the testicular follicle of grasshoppers, all endomitotic 
stages were detected morphologically. Nevertheless, DNA synthesis in these cells was not 
detected at the stages when the phases of endomitosis can be seen (Kiknadze, Istomina, 
1980). By contrast, these cells show intensive transcription by the 3 H-uridine labeling. Thus, 
in such a case, endomitotic chromosome appears to be a specific form of the polyploid 
nucleus with high transcription activity without genome multiplication (Kiknadze, 
Istomina, 1980). It can be explained, for example, by the specific differentiation of the 
testicular follicle cells, in which they lose the DNA replication cycle, but accomplish their 
tissue-specific function that implies high protein synthesis. 
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Interestingly, in this case, endometaphase chromosomes resemble the organization of 
lampbrush chromosomes. 3 H-uridine label was concentrated over the loops projected from 
the central part of endochromosomes (Kiknadze, Istomina, 1980). Such a peculiarity makes 
endomitotic chromosomes of grasshopper similar to other examples of the classic 
endomitosis. 

To explain the difference between these two "endomitoses" - replicative and non-replicative 
one, it is of interest to mention that in the snail protein gland, DNA replication was not 
detected at the terminal stage of differentiation - despite the presence of typical endomitotic 
figures (Anisimov, 1997a). However, the level of 3 H-uridine labeling was decreased 
significantly as compared to the earlier, "replicative" stages in the lifespan of the cells. 

To sum up, endomitosis represents a specific type of cell cycle, in which some degree of 
chromosome condensation and transcription is maintained throughout the whole endocycle 
(Fig.l). Therefore, endomitosis may be considered as a specific form of transcriptionally 
active nucleus (Brodsky, Uryvaeva, 1985). It is strengthened by the similarity of the 
endomitotic chromosomes with lampbrush chromosomes characteristic of the diplotene of 
the I meiotic division (Callan, 1986). Lampbrush organization and uncomplete condensation 
suggest that endomitosis is a way to gain highly specified cell types with high 
transcriptional activity of tissue-specific chromosome loci; this transcriptional activity may 
be important for the special function of the cells. 

The classic endomitosis may be of doubtless interest due to its occurrence in malignant cells, 
first of all, in cervical cancer and complete hydatidiform moles (Therman et al., 1982; 1983; 
1986). 

2.2.2 Polyteny 

The most shortened cycle that consists of merely two phases - presynthetic G and synthetic 
S - is endoreduplication that results in polyteny and G2-block (Pearson, 1974; Nagl, 1978,1995; 
Brodsky, Uryvaeva, 1985; Zybina, 1986; Zybina, Zybina, 1996, 2005; Brodsky, Uryvaeva, 
1985; Edgar Orr-Weaver, 2001). 

According to V. Brodsky and I. Uryvaeva (1985), in G2-block and polyteny the cell cycle is 
devoid of the whole mitosis; therefore, the pre- and postmitotic phases are united. In this 
case the sister chromatids do not separate, each chromatid beginning a new round of 
replication in the new S-phase. As a result of a series of successive endoreduplication cycles, 
polytene chromosomes are formed. 

Control of endoreduplication cycle is not elucidated completely. However, it is certainly 
associated with cessation of expression of cyclins A and B that are necessary for mitosis 
progression (Nagl, 1995; Edgar, Orr-Weaver, 2001). Nevertheless, the detailed study of 
expression of cyclins, cyclin-dependent kinases as well as other proteins regulating 
endocycle progression prove that endocycle does not imply the complete switching off the 
components of the mitotic machinery. The switching from mitotic cycle to endocycle itself 
appears to be impossible without expression of factors regulating transition through mitotic 
phases. In switching from mitosis to endocycles, cells in the Drosophila embryo 
downregulate the mitotic regulators Cdkl, Cyclin A, cyclins B and B3 as well as 
Cdc25/string (Sauer et al., 1995, Edgar, Orr-Weaver, 2001). Meantime, studies on Fzr/Cdhl, 
a regulator of APC, show that APC activity is required for switch from mitotic cycles to 
endocycles (Edgar, Orr-Weaver, 2001). A series of investigation proved that initiation of 
endocycles required expression of E cyclins (Knoblich et al., 1994; Lilly, Spradling, 1996; Su, 
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O'Farrell, 1998; Edgar, Orr-Weaver, 2001). Besides, it is oscillatory (i.e. cyclic) expression of 
cyclin E that is required for successive endocycles in Drosophila tissues, i.e.ovarian nurse 
cells, embryonic gut, and embryonic larval tissues (Lilly, Spradling, 1996; Su, O'Farrell, 1998; 
Edgar, Orr-Weaver, 2001). In the endopolyploid murine giant trophoblast cells the 
oscillation of cyclin E expression was not demonstrated; in this case, fluctuation of Cdk2 
occurs due to the periodic activity of the S-phase inhibitor pS/^P 2 (Hattori et al, 2000; Hu, 
Cross, 2010). Therefore, endocycle retains the cyclic expression of several components and 
regulators of the mitotic cell division machinery. 

It seems to be of importance that the degree of polytene structure widely varies, and three 
main types may be distinguished: (1) classic polyteny with characteristical band structure, 
for example, in the salivary glands of Diptera (Beermann, 1972; Kiknadze et al., 1976; 
Zhimulev, 1992); (2) the cell with uncomplete expression of polytene features, for example, 
in the giant trophoblast cells (Zybina, Zybina, 1996, 2005), in the suspensor of higher plants 
(Nagl, 1978, 1981), in trophocytes of insect ovaries (Dej, Spradling, 1999), and in various 
endocycles of Invertebrata (Nagl, 1978); (3) the cells, in which the polytene features are not 
revealed, and polyteny may be suggested basing on the multifold DNA replication in the 
absence of mitosis (Brodsky, Uryvaeva, 1985). 

At present it is considered that morphological and functional features of the classical 
polytene chromosomes are accounted for by the following factors. First, the homologous 
chromosomes are bound due to the somatic conjugation. Therefore, the cells with polytene 
chromosomes show the haploid chromosome number. Second, in the course of successive 
endocycles the daughter chromatids stay bound to the initial ones; as a result, accumulation 
of the bound chromatids leads to thickening of the chromosomes. Third, the chromosomes 
are characterized by the weakening of the cycle of spiralization-despiralization characteristic 
of mitotic cycle; the degree of spiralization corresponds to prophase or interphase. It is 
probably the reason of their significant length that exceeds many times the length of mitotic 
chromosomes. The tightness of chromatid attachment in the polytene chromosome, most 
probably, also correlates with the degree of their despiralization and depends on their 
degree of ploidy (Ashburner, 1972; Kiknadze, 1972, Nagl, 1978). 

Non-classic polytene chromosomes are well-studied in the higher plants. They were 
discovered in the highly polyploid nuclei of the provisory cells important to embryo 
development: antipods, synergids, cells of suspensor and angiospermal haustorium (Geitler, 
1953; Tschermak-Woess, 1971; Hasitschka, 1956, 1957; Hasitschka-Jenschke, 1957, 1959; 
Nagl, 1969,1978). Somatic conjugation probably is not obligatory for these chromosomes. As 
a result, nuclei of suspensor of several plants contain diploid number of chromosomes, in 
rare cases - the haploid one (Geitler, 1953; Nagl, 1978; Tschermak-Woess, 1971, 1973). In 
general, polytene chromosomes are composed of loosely attached cable-like structures, their 
ends being split more or less (Nagl, 1981). Their disc pattern is less regular than in classical 
polytene chromosomes because of not very tight attachment of chromatids. In many cases, 
chromatid attachment is not observed at the full length of the chromosome. 

The degree of chromatid attachment may widely vary at different lifespan steps and 
depends on many factors including physiological state of the cell. In particular, it depends 
on the stage of polytene chromosome formation, i.e., the number of replication rounds. At 
the initial endocycles, chromatid attachment is restricted by the pericentromeric 
heterochromatin blocks; in the euchromatic regions, chromatids are separated to a 
significant distance. Each subsequent endocycle makes polytenic structure more clear-cut, 
and the "well-developed" polytene chromosomes can be identified precisely, although their 



Cell Cycle Modification in Trophoblast Cell Populations in the Course of Placenta Formation 


235 


separate loci are less distinguishable than in the classical polytene chromosomes (Nagl, 
1978). 

Besides, the degree of condensation depends on the stages of polytene nucleus cycle. In the 
highly endopolyploid cells of suspensor at the stage of replication chromonemes decondense 
and disjoin at a significance distance (Avanzi et al., 1970; Brady, Clutter, 1974). Disjunction of 
chromatids and their decondensation is observed in euchromatin and p-heterochromatin 
regions, whereas a-heterochromatin region stays condensed and retains chromatid attachment 
throughout the whole cycle of polytene nucleus (Brady, Clutter, 1974). 

Transition to the endoreduplication (polytene) cycle that may be necessary in some specific 
cases. Thus, as pointed out by B. Edgar and T. Orr-Weaver, the endocycles, once initiated 
further mitotic divisions are "ill advised for mechanical reasons". Indeed, polyteny is present 
in the highly differentiated cells. Besides, polyteny allows cells to increase their mass and 
metabolic output. Another possible advantage of endoreduplication is that it allows combining 
fast growth of the tissue with its functioning. In this connection, a very important 
consideration is that transition to endocycles allows growth without periodic rearrangement of 
cytoskeletal elements or cell-cell contacts, as happens in mitoses, and it is less disruptive to 
highly structured tissues than in mitotic proliferation (Edgar, Orr-Weaver, 2001). All this 
seems to be appropriate to all kinds of endocycles. In this connection, polyteny appears to be 
the most advantageous to gain the highest possible ploidy levels (Nagl, 1978, 1981). By 
contrast, the lower ploidy levels are generally achieved via uncompleted mitoses, these cycles 
being more characteristic of mammalian tissues, i.e. in hepatocytes (Brodsky, Uryvaeva, 1985; 
Kudryavtsev et al, 1993; Celton-Morizur, 2010), megakaryocytes (Ravid et al, 2002); cardiac 
myocytes (Pfitzer, 1971; Rumyantsev, 1991; Anatskaya et al., 2001). 

Meantime, it seems to be important that, in some cases genome multiplication implies 
entrance mitosis, in the other cases genome reproduction is achieved without the nuclear 
envelope disappearance. The biological significance of this difference, in our opinion, is 
demonstrated by the occurrence of different genome multiplication ways and their mutual 
transition in the trophoblast of mammalian placenta. 

3. Cell cycle modification in trophoblast cell of mammalian placenta 

The trophoblast cells of the rodent embryos represent an example that in the functionally 
different cell populations of the common origin have different ways of genome 
multiplication. The level of ploidy clearly correlates with the ability of cells to invade 
endometrium, to lyse and to phagocytose partly its tissues in the course of embryo 
implantation. Beginning from the onset of differentiation, the primary and secondary giant 
trophoblast cells undergo a series of endoreduplication cycle reaching very high ploidy level 
- 512c-1024c in rat and mouse (Zybina, Mosjan, 1967; Barlow, Sherman, 1972 ; Zybina, 
Zybina, 1996, 2005); 2048c and 16384c in the field vole (Zybina et al., 1975; Zybina et al., 
2003; Zybina et al., 2009). The cells have the non-classical polytene chromosomes and can 
combine cell replication with the invasive and phagocytic activity (Zybina, Zybina, 2005). 

3.1 Polytene nucleus cycle in the secondary giant trophoblast cells 

Two phases - endointerphase (S) and endoprophase (G) - were determined in the polytene 
cycle of the secondary giant trophoblast cells in rat placenta by using 3 H-thymidine labeling 
(Zybina, Zybina, 1996, 2005). In endoprophase non-classic polytene chromosomes in a form 
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of bundles of quite condensed chromatids of chromomeric structure are observed (Fig. 2b). 
The bundles are attached to the nuclear envelope. At the endointerphase nucleus is filled by 
paired thin Feulgen-positive threads, the "threads" being scattered throughout the 
caryoplasm and bound to the nuclear envelope (Fig. 2a). The minimal chromosome 
condensation characteristic of the interphase nucleus is observed at the endointerphase; 
however, some chromosome regions are visualized at this phase. Thus, parallel 
chromosome threads with characteristic disc pattern can be observed in the site where 
chromosomes penetrate nucleolus (Fig. 2a). Later on, three phases of endoreduplication 
were discerned - Gl, S, and G2 - based on the S-phase inhibitor pS^'P 2 (Hattori et al., 2000). 



Fig. 2. Polytene chromosomes in the rat secondary giant trophoblast cell. A. endointerphase, 
b. endoprophase. 

Two phases of polytene cycle are also observed in the mouse and rabbit giant trophoblast 
cells. In endoprophase some nuclei showed lampbrush-like structure (Fig. 3a, b): loops are 
running from the heterochromatic condenced region. At the phase contrast preparations, 
polytene chromosomes look like the long threads attached to the nuclear envelope (Fig. 3c). 
DNA replication takes place at the endointerphase, as this was demonstrated by using both 
authoradiography (Zybina, 1986) and cytophotometry (Zybina et al, 1985). The level of ploidy 
of "meshwork" nuclei (Fig.2a) was 16c-256c, i.e., somewhat lower than in nuclei with 
chromosome bundles (Zybina, Zybina, 1985;1996). In particular, it may be accounted for by 
the presence of a number of nuclei, in which polytene chromosome bundles were not yet 
formed: the clear-cut bundles are observed beginning from 64c. In the group of nuclei of 
"polytene" type (Fig. 2b) the sharp peaks corresponding to ploidy classes 64c, 128c, 256c, and 
512c are observed. By contrast, in the group of "meshwork" nuclei there are many 
intermediate values that correspond to DNA synthesis. The intermediate nuclei are also 
characterized by sharp peaks, but ploidy level is lower than in "bundles": the nuclei probably 
correspond to the process of decondensation of polytene chromosome bundles or the early 
endocycles when polytene structure is yet underdeveloped. The data confirm that 
endoreduplication cycle includes the cycle of polytene chromosome condensation, the 
maximum condensation level being in the endoprophase. 


Cell Cycle Modification in Trophoblast Cell Populations in the Course of Placenta Formation 


237 


The rate of the polytene cycle proved to be rather high, it is especially high at the 
implantation period. Using DNA cytometry, it was demonstrated that at the 5-7 day of 
gestation in mouse some of the trophoblast cells undergo 3 cycles reaching 32c and 64 c. 
Thereafter, a decrease of polyploidisation is observed with subsequent constant rate of 
polyploidization: one endoreduplication cycle per day (Barlow, Sherman, 1972; Zybina, 
1986); according to autoradiographic data, duration of S-phase in rat trophoblast cells is 
about 6 h, which is approximately equal to duration of S-phase in diploid cells. However, 
duration of G-phase upon polyplodidzation increases from 8.3 to 24.9 h for 12-15 days of 
gestation (Andreeva, 1964). Meantime, there also may be an increase of S-phase duration 
from 45-h to 6-7-h upon the endoreduplication progression (Zavarzin, 1967). 

An increase of endoreduplication cycle duration was also detected in the case of classic 
polyteny of Drosophila (Rudkin, 1972). In the Drosophila salivary glands S-phase lengthens 
upon polyploidization from 2 10 n to 2 n n by 20% (Gunderina et al, 1984). However, the 
highest ploidy levels are mainly achieved via endocycles. 



Fig. 3. Polytene chromosomes in the mouse (a, b) and rabbit (c) secondary giant trophoblast 
cells, (a) - non-classic polytene chromosome bundles; (b) - lampbrush chromosome-like 
structure: the loops (arrows) run out of the condensed part of the chromosome bundle; c - 
polytene chromosome bindles seen as parallel chromosome threads (arrow), a, b - 
acetoorcein staining, c - unstained preparation at phase contrast. 

Both endointerphase or endoprophase are characterized by a certain level of transcriptional 
activity. However, in endoprophase the RNA synthesis is decreased (Zybina, 1963). 
Therefore, like in endomitosis, transcriptional activity is maintained, but oscillates in the 
course of the whole endoreduplication cycle. The nucleolus also persists throughout the 
cycle. According to the electron microscope study, the volume of the granular component 
that reflects the amount synthesis and accumulation of preribosome particle is higher at 
endointerphase; at this phase the number of fibrillar center that reflects the number of site of 
rRNA synthesis is also higher than at endoprophase (Zybina, 1986; Zybina, Zybina, 1996). In 
the highly differentiated giant trophoblast cells that exits from the cell cycle (the 3 H- 
thymidine labeling is not seen any longer), the polytene chromosome bundles become 
indiscernible, probably, due to weakening of the sister chromatid attachment. However, in 
this case, transcriptional activity does persist, and the transcriptionally active nucleolus may 
be observed (Zybina, 1986; Zybina, Zybina, 1996), although a decrease of granular 
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component and fibrillar center amount prove the attenuation of the preribosome synthesis 
and processing. In this case, however, two types of nuclei that differ in the degree of 
chromatin condensation also are observed; they are also different in the transcriptional 
characteristics of the nucleolus (Zybina, 1986; Zybina, Zybina, 1996). Therefore, in the 
absence of DNA replication the polytene cycle is maintained, it is reflected, at least, in the 
variation of the chromosome condensation and transcription activity. 

Differentiation of the giant trophoblast cells that involves exit from the regular mitotic cycle 
and the onset of endoreduplication is under the developmental control. In the differentiation 
of giant trophoblast cells from the Rcho-1 choriocarcinoma cell line the downregulation of 
Idl and Id2 occurs, being concomitant with upregulation of the basic helix-loop-helix factor 
Hxt and acquisition of an increased adhesiveness (McAuley et al., 1998) that triggers the 
onset of invasive capability of the giant trophoblast cells. Cell cycle modification is triggered 
by switched Cyclin D isophorm expression from D3 to Dl. The initiation of the S-phase 
during endocycles appeared to involve synthesis of cyclins E and A, and termination of the 
S-phase during endocycles was associated with abrupt loss of cyclins E and A. Both cyclins 
were absent from the gap-phase during endocycles, which suggests that their degradation 
may be necessary to allow reinitiation of the endocycle. The arrest of the mitotic cycle at the 
onset of endoreduplication was associated with a failure to assemble cyclinB/p34 cdkl 
complexes during the first endocycle. In the subsequent endocycles, the cyclin B expression 
was suppressed. Therefore, at the first endocycle, its regulation retains some traits of mitotic 
cycle and then switches to the specific endoreduplicative regulation pattern. Interestingly, in 
the murine giant trophoblast cell population, expression of cyclin B1 trascripts was found at 
the transcriptional level both in diploid and in polyploid cells, whereas the B1 protein was 
detected exclusively in the diploid cells. Therefore, downregulation of Cyclin B is the key 
event that switches off the mitotic machinery. Meantime, Cyclin Dl transcripts were found 
only in giant trophoblast cells, even prior to that of placental lactogen I being early marker of 
giant trophoblast cell differentiation. This pattern of expression probably means that mural 
trophoblast differentiation and endoreduplication depend on the prolonged G1-phase. The 
authors state that D-type cyclins are required for endocycle maintenance, keeping in mind 
that antisence oligonucleotids to Cyclin D3 were found to block endoreduplication in 
murine megakaryocytes; besides, overexpression of cyclin D3 proved significant increase of 
endomitosis in the megakaryocytes (Zimmet et al., 1997). The zinc finger transcription factor 
Snail regulates the "G 2 decision point" whether the trophoblast cells go through mitosis or 
enter endocycle (Nakayama et al., 1998). 

The polytene cycle of the giant trophoblast cells involves periodicity of S-phase during 
endoreduplication. During the mitotic cell cycle, biochemical events coincident with mitosis 
lead to the re-setting of the origins of replication (Elledge, 1996). This includes degradation 
of the protein Geminin that otherwise suppresses the firing of origins of replication 
(McGarry, Kirschner, 1998). Geminin mutant embryos die during preimplantation 
development and show ectopic endoreduplication in blastomeres (Gonzales et al., 2006; Hu, 
Cross, 2010). Another important mechanism to maintain periodicity during the endocycle 
involves cyclic expression of p57 ki P 2 , a Gl/S Cdk inhibitor (Hattori et al., 2000). During 
subsequent endocycles, the p57 ki P 2 level fluctuates and this identifies two phases: the endo- 
G2 phase with the p57 ki P 2 completion of S-phase and the endo-Gl-phase with p57 ki P 2 
declining several hours before entry into S-phase (Hattori et al., 2000). It is hypothesized that 
periodic expression of p57 ki P 2 protein promotes alternating S and "gap" phases in the 
endocycle (Hu, Cross, 2010), the "gap" probably corresponding to endoprophase. 
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3.2 Transition between different modes of genome multiplication 

Junctional zone and labyrinth trophoblast in the rat and field vole placenta represents a cell 
population, in which a high proliferative activity is accompanied by a multidirectional 
differentiation. These cells do not reach a high ploidy level characteristical of the primary 
and secondary trophoblast cells, they achieve the ploidy level as high as 16c and 32c ( 
Zybina, Zybina, 2000, 2005). These cell populations are characterized by different ways of 
genome multiplication in development of one cell line at different stages of ontogenesis. The 
incidence of this phenomenon among the multicellular organisms is not studied in detail. 
Nagl (1978) pointed out that the first step of polyploidization was achieved via restitution 
mitosis. In the rat proliferative trophoblast cell populations the acytokinetic and 
uncompleted mitosis are characteristic of the first step of polyploidization. The meta- and 
anaphase restitutions also were often found in these cells (Fig. 4). 



Fig. 4. Restitution mitoses in the junctional zone of the rat placenta, a, b -regular diploid (a) 
and polyploid (b) anaphases; c - a regular metaphase and metaphase with irregular 
chromosome alignment; d, e -restitutional anaphase with chromosome bridges. 

Cytophotometrically among the mitotic figures of different ploidy (4c, 8c, and 16c), all 
mitotic stages were found - from prophase to telophase. Hence, mitosis in the junctional 
zone and labyrinth trophoblast can proceed up to octaploid level. Meantime, there was a 
noticeable predominance of earlier mitotic stages (pro- and metaphase) over the later ones 
(ana- and telophases) among the polyploid mitoses (Zybina et al., 2005). Among the 
polyploid mitotic figures, mostly restitution mitoses were present, very often anaphases 
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with multiple bridges were revealed, so the cells had no chance to segregate. Therefore, 
upon a rise of the ploidy level, the ability to undergo the complete mitosis falls down. 
Besides, it is to be emphasized that upon cessation of the mitotic activity their cycles of 
replication do not stop. The cells continue to uptake 3 H-thymidine and reach the ploidy 
level 32c, sometimes 64c and 128c via endoreduplication (Zybina et al., 2000, 2005). 
Transition from the regular mitosis to the uncompleted one and then - to endocycle was 
described in several tissues of snail (Anisimov, Kirsanova, 2002). In the course of neuron 
development of the snail Succinea lauta in the dorsal bodies and procerebrums besides the 
usual mitoses there occurred metaphases resembling k-mitoses with chromosome 
disorientation in metaphase. The chromosome arrangement showed sparseness that made 
possible to count chromosome number. In anaphase there occurred chromosome lagging 
and bridges as well as lack of segregation to the poles. All this indicates progression of 
restitution mitoses. The cycle resulted in the cells 4c, 4c/2n, and 4c/4n, where c is ploidy 
determined by DNA content, and n - ploidy determined by the chromosome number 
counting. In the course of differentiation of neurons there occurred transition to the 
endomitosis cycle that resulted in nuclei 32c-64c and later - 512c and higher (Anisimov, 
Kirsanova, 2002). The data confirm the regularity found in the trophoblast cell whose initial 
steps of polyploidization (mainly up to 8c) are achieved by uncompleted (restitutional) 
mitoses, then switch to endocycles. 

It is to be mentioned that the trophoblast cell population that starts their genome 
multiplication via uncompleted mitoses does not prove invasive properties. As soon as they 
move to the border with maternal tissues (decidua basalis) or penetrated endometrium, they 
lose their capability for mitosis and DNA replication and reside inside the semiallogenic 
tissue in a form of a polyploid cells (Zybina, Zybina, 2005). Arrest of mitosis and complete 
repression of DNA replication after a series of endoreduplication cycles makes hardly 
probable the renewal of mitotic activity in the deeply invading tertiary giant trophoblast 
cells, thereby preventing the possibility of their ectopic expanding in the maternal tissues 
during the normal pregnancy. 

3.3 Genome multiplication in the trophoblast cells of carnivore and ruminants 

Trophoblast in the hemochorial placenta of silver fox and mink is also highly invasive. 
Among the trophoblast cells of fox and mink, polyploid cells prevail, the level of ploidy 
being mainly 2c-64c, the cells with highly polyploid nuclei 128c and 256c being rare (Zybina 
et al., 1992, 2001). 

The common ways of polyploidization appear to be the same as in the rodent placenta. The 
nuclei reach 4c-8c via restitution mitoses, whereas the higher ploidy levels are achieved via 
endoreduplication (Zybina et al, 2001, Zybina, Zybina, 2005). 

Ruminants do not have highly invasive trophoblast cells: their placenta is epitheliochorial in 
which trophoblast cells mostly attach the uterine epithelium and do not penetrate uterine 
wall. Among the trophoblast cells there is a portion of polyploid cells mostly up to 8c 
(Klisch et al., 1999). The examples considered in this chapter confirm the regularities 
obtained in the rodent placenta. The invasive trophoblast cells in Carnivora that come into a 
close contact with the allogenic tissue reach high ploidy levels via endocycles, whereas the 
low-invasive trophoblast cells in ruminants reach 4c-8c and some higher via acytokinetic 
and other uncomplete mitoses. 
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3.4 Genome multiplication of the extravillous trophoblast cells in the human placenta 
alongside the invasive pathway 

In the human placenta, polyploidization also takes place, though the cells do not reach as 
high ploidy level as other Mammalia. Meantime, like in other mammals, the highest ploidy 
level are characteristic of the invasive trophoblast cells. 

In human placenta, the differentiation of the trophoblast cells that invade endometrium 
during pregnancy is accomplished in the cell columns at the tips of encoring villi 
(Kaufmann, Castellucci, 1997). The invasive pathway includes several steps (Zybina et al., 
2002, Fig. 5a): (a) stem cells of cell column (CC) attached to the basal membrane at the apical 
part of the ancoring villi; they have high mitotic activity and express integrins 
(Kaufmann, Castellucci, 1997); (b) a proximal part of CC composed of a few compact layers 
of proliferative trophoblast cells, in which mitotic figures are present; (c) a distal part of CC 
composed of more loosely arranged EVT cells that contain larger nuclei and more abundant 
cytoplasm; they express integrins asPi, a v p 3 , and a v Pi (Kaufmann, Castellucci, 1997); (d) EVT 
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Fig. 5. Invasion of the extravillous trophoblast into the uterine wall in human placenta and 
its polyploidization. A - the trophoblast cells are proliferative in the zone of stem cells and 
proximal part (pp) of cell column, in the distal part (dp) they acquire invasive phenotype 
and migrate into junctional zone (jz), i.e. decidualized endometrium, a proportion of 
trophoblast cells reach myometrium (myo); B - polyploidization of the human extravillous 
trophoblast cells in the course of invasive pathway. 
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of the distal CC part, most closely attached to the basal plate; (e) invasive EVT cells 
determined by the cytokeratin-7 positivity within the materno-fetal junctional zone; (f) 
invasive, cytokeratin-7-positive EVT cells within myometrium. 

The stem cells were found to be mainly diploid; a small amount of tetraploid cells (13.8 ± 
2.3%) could be attributed to G 2 -phase. The percentage of polyploid cells rose upon cell 
differentiation, so that in the distal part of CC the majority of EVT cells became polyploid, 
the portion of tetraploid cells (58 ± 1.5%) exceeded that of diploid ones (38 ± 1.5%); some 
amount of octaploid nuclei appeared. In the group of EVT attached to the basal plate, but 
not invading it, the percentage of tetraploid cells was the highest (74.7 ± 1.7%), and the 
octaploid cell group rose to 4.9 ± 0.8%. The EVT cells invading the decidualized 
endometrium also were predominantly polyploid: the fraction of octaploid nuclei increased 
to 9.7 ± 1.0% and some amount of 16c nuclei appeared (1.4 ± 1.0%), while the percentage of 
diploid cells rose slightly. This tendency did also take place in the group of invasive EVT 
cells within myometrium, although polyploid cells prevailed in the majority of the placentas 
studied (Fig. 5b). The portion of cells of different ploidy levels changed statistically 
significantly at every stage of differentiation of the EVT and invasive pathway (P < 0.01). 
Besides, it is reasonable to note the individual variability of ploidy of invasive trophoblast 
cells, the higher ploidy level being established in the pregnancies with the greater amount of 
the extravillous trophoblast (Zybina et al., 2002). In the latter case, a higher level of pKi-67 
expression was detected, which indicates their higher proliferative activity (data not 
shown). 

Thus, in human placenta, like in other mammals, the trophoblast cells undergo 
polyploidization in the course of acquirement of invasive properties. However, the ploidy 
level is not as high as in rodent placenta. 

As to mechanism of polyploidization, the uncompleted mitoses most probably do not play a 
significant role, despite the low ploidy level. The DNA content in the mitotic figures was 4c, 
i.e., the diploid cells, most probably, are only able to divide mitotically in the cell columns. 
Simultaneously, in both the villous and, partly, the extravillous trophoblast cell populations 
of the human placenta, there occurs a high amount of cells containing a great number of 
chromocenters, some of which often resemble endomitotic chromosomes (Zybina et al., 
2004). Rather numerous small nucleoli were observed to be attached to the chromocenters. 
The chromocenters/endochromosomes were found in the stem cells and in the proximal 
part of CC. Not infrequently, some large nuclei contained chromocenters that were 
markedly larger as compared with the neighbor cells. In the distal part of CC as well as in 
the EVT invaded endometrium and myometrium nuclei with a relatively uniform, network¬ 
like chromatin distribution were observed. The number of chromocenters in the EVT in the 
Feulgen and DAPI-stained sections was 15-30, the mode being 20-24 per nucleus, which is 
comparable to the haploid chromosome number in the human genome. The number of 
chromocenters / endochromosomes did not increase significantly with rise of the ploidy 
level. These findings indicate that polyplodization do not involve chromatid disjunction; 
moreover, the haploid number of endochromosomes suggests somatic conjugation 
characteristic of polyteny. Therefore, it is reasonable to suggest the first step of 
endoreduplication / polytenization in the extravillous trophoblast cells. 

The intracellular localization of Ki-67 in EVT also provides some data on the ways of 
genome multiplication. In the stem cells and cells of the proximal part of CC, the Ki-67 
immunostaining reminded the chromocenters revealed in Feulgen-, DAPI-, and 
hematoxylin-stained preparations. Besides, 2-5 small MIBl-positive nucleoli were often 
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revealed in the cells of this CC part. By contrast, in the EVT of the distal part of CC the pKI- 
67 was located mainly in 1-2 large nucleoli, rather than in chromocenters. Such difference in 
the intranuclear distribution of Ki-67 indicates changes in the cell cycle, when taking into 
account that different patterns of the Ki-67 localization characterize different stages of the 
cell cycle (Starborg et al., 1996; Bridger et ah, 1998). Thus, in the early Gl, pKi-67 is detected 
in numerous foci of centromeric and telomeric DNA localization scattered throughout the 
nucleus (Bridger et al., 1998). In the late Gl, S, and G2-phase, pKi-67 is found inside the 
well-formed nucleolus (Verheijen et al., 1989; Kill, 1996), in mitosis this protein is associated 
with chromosomes (Bridger et al., 1998; Suurmejer, Boon, 1999). Transition to the 
predominant nucleolar localization of Ki-67 suggests that redistribution of the protein 
characteristic of the mitotic cycle does not take place. In combination with cycle progression 
indicated by Ki-67 immunopositivity and lack of mitoses, it is also a proof for transition to 
the shortened cell cycle - endoreduplication. However, the relatively low ploidy level 
suggests progression of a few endocycles, afterward the cells leave the cell cycle, and the 
invasive EVT loose their capability for replication. 

Populatios of the non-invasive - villous - trophoblast also contain a number of octaploid 
cells (Wakuda, Yosida, 1992; Potgens et al., 2001), although their frequency is not as high as 
in cells of invasive pathway. 

Giant trophoblast cells undergoing endomitosis and non-classic polyteny were found in 
some cases of normal human pregnancy (at the early stages of embryonic development) as 
well as in some pathological cases. The giant cells with bundles of parallel chromatin 
threads are more specific for trophoblast malignant tumors (Sarto et al., 1982; Therman et 
al., 1983; Kuhn, Therman, 1988). Endomitotic chromosomes were revealed in the human 
hydatidiform moles (Therman et al.,1983). These chromosomes sometimes weree not 
detached from each other in the course of consecutive replication cycles and formed clusters 
of adjacent endochromosomes. Therefore, endocycles (endomitosis and endoreduplication) 
appear to be characteristic of human trophoblast cells. Meantime, the well-developed non¬ 
classic polyteny in the human trophoblast appears to be not numerous, as the cells, most 
probably, do not undergo many cycles of endopolyploidization. 

Regulation of cell polyploidization of the human EVT is not understood completely, 
although some data suggest similarity and specificity as compared to the rodent 
trophoblast. Thus, "mitotic" cyclins A and B1 as well as Ki-67 and PCNA were 
immunolocalized in proximal and distal EVT as well as in the EVT that invaded the upper 
decidual segments (Korgun et al., 2006). Cell cycle inhibitors p27 and p57 were expressed in 
all extravillous trophoblast cells (EVT), but p21 was not. The authors state that the set of cell 
cycle regulators present in the cell column strongly suggests that cells in both proximal and 
distal part of CC do not acutely proliferate. However, they maintain this ability because they 
have not left cell cycle, i.e. they express Ki-67 and PCNA as well as mitotic cyclins. We are 
of opinion that p27 and p57 expression reflect attenuation of proliferative activity in the EVT 
trophoblast along the invasive pathway. Meantime, a strong expression of p57 may be a 
feature of the endoreduplication cycle, because p57, being the S-phase inhibitor, proves 
oscillatory expression in the course of mouse giant trophoblast cell endoreduplication 
(Hattori et al., 2000; Hu, Cross, 2010). Mitotic cyclin B1 (as judged from the photo, Korgun et 
al., 2006) also shows downregulation in the invasive pathway. The immunolabelled cells 
may be found in the CC, but not inside the decidualized endometrium. However, 
endomitotic polyploidization may involve cyclin Bl/cdk2 activity, as this way of cell cycle 
involves chromatid segregation, in which p34/cdk2 may take part. 
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Cyclin D1 also was expressed in the human cytotrophoblast compartment, which suggests 
its cell cycle processes up to the third trimester of pregnancy (De Falco et al., 2004). In fact, 
Cyclin D expression may be associated with polyploidization.lt is confirmed by the data 
obtained on megakaryocytes. Thus, Cyclin D3 upregulation in response to thrombopoetin in 
the megacaryocyte lineage and Cyclin D3 overexpression were found to increase the 
megacaryocyte ploidy (Zimmet et al., 1997). A decrease in the level of Cyclin D3 blocks 
polyploidization (Wang et al., 1995). In Drosophila, Cyclin/Cdk4 also increases ploidy in 
endocycling tissues (Datar et al., 2000); therefore, D cyclins appear to be critical for 
endocycle progression. 

Cyclin E may be also involved in transition to endocycles in human extravillous trophoblast 
endocycle. Thus, Cyclin E was found to express in the EVT, the highest level was found in 
the CC. In the EVT of CC, expression of Cyclin E started more distal as compared to Ki-67 
and was maintained (with less intensity) in the deeper layer of interstitial trophoblast 
(Bamberger et al., 2003). According to flow cytometry, there is a correlation between p27 and 
Cyclin E expression in the human cytotrophoblast of normal placenta and hydatidiform 
moles (Fukunaga, 2004). The role of Cyclin E is supported by the data of the knockout 
experiment on murine embryos (Parisi et al., 2003). Double knockout mice with targeted 
genes encoding Cyclins El and E2 showed a marked reduction of ploidy of the trophoblast 
cells. It suggest a specific role of Cyclin E in trophoblast cell endocycle entry, probably in the 
absence of mitoses. 

Anyway, human trophoblast polyploidization needs further investigation. 

4. Depolyploidization and genome segregation is a terminal step of the 
polytene nucleus cycle 

4.1 Transition from the polytene nucleus to the polygenomic one 

The lifespan of the giant trophoblast cells in most of rodent placentas finishes by 
disintegration of the polytene nucleus into low-polyploid nuclear fragments, so that the 
highly polyploid mononuclear cell becomes a polykaryocyte (Zybina, 1986, 1990; Zybina, 
Zybina, 1996; Zybina et al., 2005). It is to be emphasized that this process is accomplished at 
the stage of the complete cessation of DNA replication and attenuation of RNA synthesis. It 
is not quite clear how the redistribution of chromosomes is achieved. However, in some 
rodent giant trophoblast cells there occurs disintegration of the polytene chromosome 
bundles into numerous paired endochromosomes. It is the most strongly pronounced in the 
highly invasive supergiant trophoblast cells in the field vole placenta, which reach very high 
ploidy level 256c-16384c (Zybina et al., 2009). The degree of chromatid attachment in these 
cells is weakening, they detach one-by-one from the surface of the chromosome bundle. As a 
result, the supergiant nucleus appears to be filled with rather decondensed 
endochromosomes; at the later stage the endochromosomes undergo condensation and look 
like paired endomitotic chromosomes (Fig. 6). The stage may be considered as a maximal 
level of chromosome condensation in the polytene nucleus cycle; it cannot be ruled out that 
it is one of the stages of chromosome rearrangement before depolyploidization. 

Polytene chromosome disintegration into endochromosomes was also described in the 
endocycle of Dipteran cell types, in which classic polyteny occurs at one of the stages of the 
cell lifespan. Thus, in the nurse cells of ovaries of Calliphora erythrocephala polytene 
chromosomes with the clear-cut disc pattern were seen in the early endocycles. At the later 
stages, loosening of the disc pattern occurred with simultaneous shortening of chromosomes 
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due to the spiralization of chromatid and their detachment from each other. As a result, the 
polytene chromosome broke down into endomitotic chromosomes (Wasserlauf et al., 2003; 
Ananina, Vedernikov, 2003, 2005). DNA hybridization in situ with chromosome 3 probe 
showed that endochromosomes did not spread throughout the nucleus, but form a 
chromosome territory formed by a multitude of homologous endochromosomes. This 
process seems to be analogous to the depolytenization in the giant trophoblast cells. In this 
case the chromosome structure and the activity cycling also take place. Thus, the nucleolar 
organizer activity varies during the polytene cycle and cell onthogenesis. In the course of 
polytene chromosome compactization, nucleolus falls into numerous small nucleoli, then 
they progressively decrease in number. At the stage of endomitotic chromosomes the 
nucleoli disappear completely (Wasserlauf et al., 2003). The stage functionally corresponds 
to mitosis. Thus, disaggregation of polytene chromosomes into endomitotic chromosomes 
occurs at the stage of attenuation of transcription. 



Fig. 6. Supergiant trophoblast cells of Microtus rossiaemeridionalis. a - a supergiant nucleus 
with bundles of non-classic polytene chromosomes; b - desaggregation of polytene 
chromosomes into endochromosomes. 

Staining: a - Feulgen reaction, b - bromphenole blue. 

4.2 Fragmentation of the giant trophoblast cells 

Genome multiplication in the differentiation of a wide range of tissues in most cases 
proceeds irreversibly (Brodsky, Uryvaeva, 1985). One property intrinsic to many endocycles 
is that once they are initiated, further mitotic divisions are ill-advised for mechanical reasons 
(Edgar, Orr-Weaver, 2001). The mechanism of this phenomenon has not been yet clearly 
deciphered. These statements are proved, in particular by the examples that illustrate loss of 
the key components of the mitotic machinery in the cells undergoing endocycles. Thus, in 
the ovarian trophocytes of Drosophila (Mahowald et al., 1979), on reaching 8c when mitotic 
divisions decline, the centrioles migrated into oocyte through the intercellular bridges and 
finally lied adjacent to the oocyte nucleus. Thus, disconnection of chromosome, centrosome, 
and centriolar cycle and disaggregation of the centriolar complex may be a way of 
commitment to switching to the irreversible polytene cycle. 

The exception is the mammalian trophoblast cell that undergo depolyploidization on 
reaching high levels of ploidy via a series of endoreduplication rounds (E. Zybina et al.. 
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1979,1986). It should be emphasized that fragmentation of the highly endopolyploid nuclei 
begins after cessation of cycles of DNA replication (Zybina, 1986; Zybina, Zybina, 1996), i.e., 
at the second half of pregnancy. The non-classic polytene chromosomes are not revealed at 
this period any longer. 

In the course of fragmentation, deep folds of nuclear endvelope (NE) dissect an initial 
nucleus into nuclei of smaller size (Fig. 7a). In many cases, NE emerges projections at one 
pole of the nucleus. Later on, small round nuclear fragments are detached from one side of 
the initial giant nucleus (Fig. 7b). In other cases, the whole nucleus turns out to be 
segregated into several dozens of small nuclei tightly attached to each other. Sometimes, the 
both processes take place in the same nucleus, i.e in case of binucleate giant cell; otherwise, 
initial nucleus first fall down into two giant nuclei, then they undergo fragmentation. 
Distribution of chromosomes, most probably, proved to be genome-wise. The DNA 
cytometry demonstrated that the ploidy level of nuclear fragments corresponded to 
predominance of lc, 2c, 4c, 8c (Zybina et al., 1975; Zybina, 1990; Zybina et al., 2005). The 
natural chromosome markers - nucleoli, Barr bodies (inactivated X-chromosomes) and 
condensed heterochromatin regions of polytene chromosomes - are distributed into nuclear 
fragments in accordance with ploidy their level (Zybina, 1986; Zybina, Zybina, 1996). The 
whole-genome chromosome distribution into the nuclear fragment is confirmed by the use 
of another natural chromosome marker of the interphase nucleus, i.e., gonosomal chromatin 
bodies (GCB, Zybina et al., 2003, 2005). Cytophotometrical measurement of DNA content in 
the nuclei, nuclear fragments, and simultaneously in the gonosomal chromatin bodies was 
made in the SGTCs of field vole M. rossiaemeridionalis. In most cases, 1-2 GCBs were found in 
the nuclear fragments of different ploidy levels. In the nuclear fragments GCB, the DNA 
content decreased mostly proportionally to the DNA content in the whole fragments 
corresponding to 2c, 4c and 8c. 



Fig. 7. Fragmentation of the secondary giant trophoblast cells in rat placenta, a - at the 
beginning of fragmentation the giant nucleus breaks down into two parts that undergo 
further fragmentation; b - the small nuclear fragments are budding off one side of the initial 
giant nucleus. Staining: Heidengein hematoxylin. 

At the light microscope level it may be observed in rat SGTC that deep folds of the nuclear 
envelope (NE) dissect the nucleus, these folds separating single nuclei of smaller size (Fig. 
7). In many cases the projections lie only at one pole of the nucleus (Fig. 7); in the same cases 
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their localization coincides with the nuclear zone, in which fragmentation begins and 
fragments start to detach from the initial nucleus (Fig. 7). 

Observation of the ultrastructure of the initial giant nuclei during their subdivision into 
fragments proved NE and its derivatives to play an active role in this process (Zybina, 
Zybina, 2008). The very deep folds divide the nucleus into several lobes. Apart from NE, a 
vide range of other NE derivatives may provide a reserve material for the nuclear 
fragmentation: annulate lamellae (AL), membranous tubules packs of pore complexes, etc. 
AL represent outgrowth of the inner or outer membrane of nuclear envelope consisting of 
parallel membranes joined by numerous pore complexes (Kessel, 1992; Zybina, Zybina, 
2008). The intranuclear ALs of SGTCs are observed mainly at the periphery of the nucleus, 
i.e., in the zone of the most extensive NE folding. The areas covered by the double smooth¬ 
surfaced membranes were also found in the superficial part of the karyoplasm in the zone of 
fragmentation. 

It is worth mentioning that a significant portion of SGTC undergoing fragmentation does 
not show any signs of apoptosis; their cytoplasm is reach in ribosomes, the numerous 
mitochondria, AER channels and cisterns, as well as elements of the Golgi complex. 
Meantime, some SGTC undergoing fragmentation sometimes show some signs of 
degeneration (Fig. 8b). 



Fig. 8. Ultrastructure of the nuclear fragments, a - nuclear fragments surrounded by narrow 
cytoplasmic territories are separated by double membrane formed by agranular 
endoplasmic reticulum; b - cytoplasm shows signs of degeneration; a nuclear fragment 
shows a transcriptionally active nucleolus; it is isolated from the rest of the giant trophoblast 
cells by a wide channels of agranular endoplasmic reticulum; a bundle of intermediate 
filaments is seen inside the double membrane that delimits the nuclear fragment. 
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Each nuclear fragment is surrounded by the two-membrane NE (Fig.8a) containing 
numerous pore complexes. Progression of fragmentation is accompanied by the chromatin 
condensation and inactivation of the nucleolus. In the large fragments with the low- 
condensed chromatin the nucleolus consists mostly of the fibrillogranular component and 
the fibrillar centers surrounded by the dense fibrillar component, it indicating their 
transcriptional activity. The smallest nuclear fragments indicates, most probably, the final 
step of fragmentation: the chromatin is highly condensed and the nucleoli consist of electron 
dense fibrils that correspond to the most inactivated nucleolus (Zybina, Zybina, 2008). It 
indicates a decrease in attenuation of transcription that results, most probably, in apoptosis 
of the giant trophoblast cell. 

Membranous structures were found to play another role in the transformation of giant 
trophoblast cells (Zybina, Zybina, 2008). At final steps of fragmentation, some nuclear 
fragments were observed to be surrounded by the cytoplasmic area delimited by the double 
membrane (Fig.8a, b). Upon compartmentalization of the cytoplasm, the narrow cisterns 
emerging from the outer membrane of NE pass to the system of the AER channels, which, in 
turn, participates in formation of the double membranes separating the cytoplasmic zones 
around individual nuclear fragments or their aggregates inside the trophoblast 
polykaryocytes (Fig. 8a). Interestingly, the desmosome-like structures appear in flat cisterns 
of AER that isolate the cytoplasmic area between two fragments (Fig. 8b, (Zybina, Zybina, 
2008).). It suggests that the fragments behave, in some aspects, like individual cells. The 
cytoplamic territories that surround the separate nuclear fragments contain the complete set 
of viable cytoplasmic organelles — mitochondria, Golgi complex, GER and AER cisterns, 
numerous polysomes. Signs of degeneration in most cases were not observed in the cell 
compartments surrounding the nuclear fragments. 

Sometimes, the nuclear fragments with the surrounding cell territory appear to be isolated 
from the rest of the cells, in which signs of degeneration are observed (Fig. 8b). These 
fragments look viable: they contain active nucleolus, many ribosomes and mitochondria in 
the narrow layer of the cytoplasm surrounding the nuclear fragment; in this case the 
cytoplasm territory is isolated from the rest of the cytoplasm by wide cisterns of the 
agranular endoplasmic reticulum. 

A question arises, which structures play the role of the centriolar machinery in the course of 
fragmentation of the highly endopolyploid nucleus. Glasser (1984) put forward an idea that 
the centrosome-like bodies consisting of intermediate filaments may play this role. Indeed, 
at the stage of formation of the cytoplasmic zones around the nuclear fragment bundles, the 
6-12-nm intermediate filaments (IF) are observed in the perinuclear zones of the cytoplasm 
(Zybina, Zybina, 2008). 

The decrease of ploidy level is to involve the somatic reduction process, in which genome 
segregation is achieved in the absence of DNA replication. Non-numerous examples of 
somatic reduction are described in the literature. Thus, in ileum of the mosquito Culex, 
nuclei show signs of polyteny. In the course of metamorphosis, polyploid mitoses arise. The 
mitoses are followed by a series of reductional mitoses that proceed without intervening 
interphase and result in formation of octa- and tetraploid nuclei (Berger et ah, 1938). Similar 
divisions are also described in ileum of the mosquito Aedes aegi/pti (Risler, 1959). The well- 
studied example of genome segregation is fragmentation of the highly polyploid nucleus in 
radiolarian (Raikov, 1982) that results in the breakdown of the primary endopolyploid 
nucleus into individual genomes. 
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The above-mentioned processes represent definite steps in differentiation of unicellular and 
invertebrates, they are less pronounced in vertebrates, by coinciding with completion of cell 
cycles. Fragmentation, i.e., somatic reduction of the trophoblast endopolyploid nuclei, also 
represent a definite, specifically terminal, step of the cell lifespan that coincides with 
cessation of cell cycles. These cells, most probably, are incapable of renewal of mitotic 
divisions. Therefore, the biological significance of the phenomenon is probably that 
cessation of functioning of provisory organ, i.e., placenta, makes impossible the renewal of 
proliferative activity of its cells, especially taking into account their characteristic feature of 
invasion of maternal tissues. Nevertheless, the possibility of isolation of diploid nuclear 
fragments with surrounding cell territories anticipates the probability of their 
transformation into full-value cells. Further studies might possibly throw light on 
significance of this phenomenon. 

5. Reversibility of polyploidization in the lifespan of ceils 

The processes described above represent certain steps in differentiation of normal animal 
cells. 

Nevertheless, some pathological processes, including carcinogenesis, imply a possibility of 
progression of polyploidization and depolyploidization cycles, the latter resulting in some 
cases in formation of the actively proliferating cell clones. Investigation of mechanisms of 
depolyploidization showed some traits similar to fragmentation of the highly 
endopolyploid nuclei in the mammalian trophoblast. 

A reserve mechanism that allows overcoming the impossibility of cell division under 
conditions of disorder of the mitotic spindle microtubules was observed in the experiment 
of irradiation and treatment by a microtubule inhibitor SK&F on the p53 mutated Burkitt 
lymphoma cell lines (Erenpreisa et ah, 2002). Electron microscopy showed large cells with 
the lobulating, segmenting, and budding nuclei that gave rise to many micronuclei and/or 
subnuclei. Study of ultrastructure revealed large, endopolyploid cells undergone significant 
changes, among them envelope-limited chromatin sheets (ELCS) and some other unusual 
membranous structures including AL and the so-called confronting cisterns (CC). The 
selective DNA cytometry in these nuclei and their segments showed the most frequently the 
2N multiple DNA content. In irradiated cells, single ALs were seen in the close vicinity to 
ELCS, where they were observed to sequester the cytoplasm around the micronuclei, 
nuclear segments and buds (Erenpreisa, 2002). The process is similar to cytoplasm 
compartmentalization around the trophoblast cell nuclear fragments. However, unlike 
SK&F-treated cells, in trophoblast cells the compartmentalization of the cytoplasm is 
accomplished by cisterns of AER continuous with the outer nuclear membranes. 

In search for a mechanism for genome reduction, acute radiation-induced endopolyploid 
tumor cells were demonstrated to exit from the mitotic cycle and altered DNA repair. It was 
found that the 10 Gy dose of photon irradiation (IR) to p53 function-deficient cell lines 
caused them to undergo extensive changes in the cell cycle progression, so that they 
transiently form endopolyploid giant cells. After G2-arrest for one to two days, the cells 
enter aberrant, often bridged mitoses and then either undergo mitotic death or, through 
"mitotic slippage" or bi-nucleation, become tetraploid. On days 5-6 post irradiation, this 
phase was switched to depolyploidization and bipolar and multipolar cell divisions 
occurred. It seems to be important that Aurora B-kinase was expressed in the newly-formed 
cells. From days 7-9 onwards death of most giant cells can be observed, and cell divisions 
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are seen in the endopolyploid tumor cells. The cells gave rise to small colonies of 
paradiploid cells (Illidge et ah, 2000; Ivanov et al., 2003; Erenpreisa et al, 2005, 2008). It 
seems to be important that Aurora B-kinase was expressed during segregation of these 
newly-formed cells suggesting the mitotic mechanism to be involved in this process 
(Salmina et al, 2010). Interestingly, the p-galactosidase expression changed from positive in 
the giant cells to negative in the small proliferative cells resulted from reduction division 
(Puig et al., 2008; Wheatley, 2008). Therefore, restitutive divisions of large (endopolyploid) 
cells represent a possibility that the tumor can recover post-treatment and start up again 
(Whetley, 2006). 

Upregulation of meiosis-specific genes, such as REC8, MOS, and SPOll, was demonstrated 
in these cells (Kalejs et al, 2006; Ianzini et al, 2009). Besides, signs of pluripotency and self¬ 
renewal stem cell genes NANOG, OCT4, and SOX2 in this polyploidy-dependent survival 
mechanism were found (Erenpreisa, Cragg, 2010). Therefore, polyploidization and 
depolyploidization cycles potentially facilitate survival and propagation of the tumor cell 
population. 

Therefore, the recent findings proved that in some cases polyploidy may provide cells a 
reserve way to survive. The polyploid cells, being more resistant to mutagenic factors and 
mitotic poisons, can be viable for some time and, moreover, may give rise to a viable diploid 
progeny with high proliferative potential. 

6. Conclusion 

Somatic polyploidization characteristic of quite a few tissues in multicellular organisms is 
achieved by means of various modifications of cell cycle, i.e., uncompleted mitoses 
including restitution and acytokinetic mitoses followed by polyploidizing mitoses as well as 
endocycles implying endomitosis and polyteny. These cell cycle modifications are 
accounted for by reduction of different phases of mitosis or by its almost complete absence. 
Nevertheless, if in the absence of mitotic division cycle of DNA, replication is retained, it is 
accompanied by the cyclic changes of transcription activity and chromosome condensation. 
Progression of the modified cell cycles, like in the regular mitotic cycle, is regulated by the 
activity of the cyclin-kinase complexes. However, some specific difference in their 
expression is observed. In particular, in the course of transition of trophoblast cells from 
mitotic cycles to endocycles a change of cyclin D isophorms - from D3 to D1 - was found. 
Arrest of mitotic cycle in transition to endoreduplication cycle is accounted for by the lack of 
assembly of cyclin B/p34 cdkl -complex in the first endocycle, the subsequent endocycles not 
involving at all the cyclin B expression. 

In the course of uncompleted mitoses the majority of mitotic machinery components and 
regulation are retained, which favors retaining their ability to undergo mitoses. Thus, 
Aurora B-kinase, in most cases, proved to retain the mitosis-specific time-table of relocation. 
By contrast, switching off the expression of several factors that ensure the sequence of 
mitosis progression (p34 cdkl , Survivin, etc.) may be accounted for by the lack of kinetochore 
binding to the spindle and other event that result in chromosome missegregation in ana- 
and telophase and polyploidization. Unlike the uncompleted mitoses, endocycles are more 
prone to lose their capability for renewal of mitotic activity. 

In the mammalian placenta, various types of cell cycles are characterstic of the functionally 
different trophoblast cell populations. In this aspect the giant rodent trophoblast cells 
combine lysis and phagocytosis of allogenic maternal cells with their intensive growth and 
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polyploidization (mostly up to 128c-512c) via endoreduplication, i.e. without disappearance 
of the nuclear envelope. It allows ruling out contact of genome to the rests of phagocytosed 
cells that may be mutagenic to the trophoblast cells. By contrast, cells of the placenta 
junctional zone that partly share the traits of the trophoblast stem cells and undergo 
polyploidization via uncompleted mitoses do not contact to the allogenic maternal cells that 
probably favor their genome protection. This example illustrates the significance of different 
ways of genome multiplication for the tissue-specific functions. 

In the rodent giant trophoblast cells, cessation of the DNA replication does not result in the 
complete cessation of the cyclic changes of transcription activity and chromosome 
condensation. It cannot be ruled out that the highest level of "cyclicity" is disintegration of 
polytene chromosomes onto the olygotene chromonemes and/or endochromosomes that 
may be a prerequisite of the whole-genome depolyploidization of the giant nuclei that result 
in the polykaryocyte formation. This event represents the terminal step of giant cell lifespan 
before their apoptosis. However, similar processes described in many pathological processes 
suggest retaining some reserve mechanisms that allow cells to survive under extreme 
conditions. 

In the human placenta, highly invasive extravillous trophoblast cells also undergo several 
endocycles before the start of invasion of endometrium. Transition to endomitosis and 
endoreduplication before the complete cessation of DNA replication favors formation of 
cells more resistant under condition of their contact with allogenic maternal tissues. 
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1. Introduction 

Neurons located in the mammalian central (CNS) and peripheral (PNS) nervous system are, 
for the most part, a stable post-proliferative population. Indeed, recent evidence indicates 
that a vast majority of CNS neurons pushed into the cell cycle will die. Thus, understanding 
the processes of induced DNA replication, neural proliferation and neurogenesis in adult 
individuals is a major goal of modern neurobiology. Neurogenesis is a process that 
produces functional neurons from stem/progenitor cells. It is widespread during 
development, but in the mature mammalian nervous system, under physiological 
conditions it occurs only in two discrete regions: the subventricular zone of the lateral 
ventricle (SVZ) and in the subgranular zone (SGZ) of the dentate gyrus (DG) in the 
hippocampus (Gage 2002; Kempermann et al. 2004). It is believed that in the adult 
mammalian CNS, neurogenesis outside of these two regions is extremely limited or non¬ 
existent. However, some studies of ganglionic nerve centers located outside the CNS 
revealed DNA replication leading to neural cell division and differentiation into neurons, 
presenting the possibility of adult neurogenesis in the PNS (Devor and Govrin-Lippmann 
1985; Czaja et al. 2008; Silva et al. 2008; Gallaher et al. 2011). In this chapter we will present 
background knowledge regarding DNA synthesis leading to production of new neurons in 
the adult mammalian CNS and PNS. 

In the CNS, some external factors, such as brain insults, have been reported to trigger adult 
neurogenesis in regions otherwise considered to be non-neurogenic (Parent 2003). Several 
studies support the hypothesis of insult-induced neurogenesis in the CNS and show that 
during development, damage to the adult nervous system induces factors and mechanisms 
that control neuronal proliferation, migration, differentiation, and connectivity. However, 
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an induction of large-scale neurogenesis that results in addition or replacement of a 
significant number of neurons has not been reported in adult mammalian brain, although it 
does occur in some non-mammalian vertebrates. The presence of neural stem cells in the 
brain offers the potential for neuronal replacement and regeneration after injury. The most 
broadly-studied is ischemia-induced neurogenesis (Parent et al. 2002). However, the 
outcome of ischemia-induced neurogenesis is not robust and does not produce large 
numbers of new neurons. Few of the newborn neurons persist for an extended period of 
time after the insult, and this population is significantly smaller compared with the 
substantial increase in proliferating and migrating neuroblasts. There is also little evidence 
to suggest that those newborn neurons that do survive become functional and integrate into 
the existing circuitry. On the other hand, the response of the CNS after injury is not limited 
to the production of neurons. Induced stem cells and neural progenitors may also 
differentiate into astrocytes and oligodendrocytes. Thus, the increase in CNS progenitor 
cells in response to injury is not limited to neural progenitors, since glial progenitors 
increase as well. Radial glia cells function as glia progenitors and provide a cellular source 
for neurogenesis during development but may also act as neuronal progenitors in the adult 
CNS by dividing and producing astrocytes and neurons. The aim of this chapter also will be 
to review the current knowledge related to injury-induced neural proliferation in the CNS. 
The situation in the PNS may be significantly different than in the CNS. In the PNS, neural 
crest cells, which drive the development of many different nerve structures, also may play a 
role in the post-developmental period. This is because some of the migratory elements 
persist as a peripheral reserve pool of multipotent stem cells that can undergo a late 
differentiation into mature glia or neurons (Lagares et al. 2007). The recruitment of this 
reserve pool may be stimulated by various physiological and pathological conditions in 
which new cells are needed. We will review the current knowledge regarding neural 
proliferation in the adult PNS and discuss the possibility that adult neurogenesis is not 
restricted only to very limited regions of CNS. The alternative to late differentiation of 
multipotent cells may be the process of dedifferentiation in the PNS. This is the process by 
which differentiated neural cells revert to a pluripotent or multipotent state. The process of 
induced reprogramming of cell fate, which is the transforming one cell type into another by 
exogenous factors, might have enormous potential for medical and research advances. 
However, this potential of cell fate reprogramming has not been well explored. Therefore, in 
this chapter we will present the body of knowledge suggesting the possible cellular 
candidates in the PNS for dedifferentiation after injury. The possibility of induced DNA 
replication, neural proliferation and adult neurogenesis in the PNS is almost completely 
unexplored territory. Moreover, the few available studies have resulted in contrasting 
conclusions. However, the PNS has been reported to utilize neural precursors that 
proliferate in vitro and can be induced to differentiate into neurons. The possible existence 
of in vivo damage-induced replacement of neurons destroyed by injury, revealed by several 
investigators in the CNS, is limited to a few reports in the PNS (Czaja et al. 2008; Gallaher et 
al. 2011). Therefore, we will review the current literature dealing with DNA synthesis, 
neural proliferation and neurogenesis in the adult PNS under physiological and 
pathological conditions, and we will discuss possible mechanisms involved in these 
phenomena. 

The main purpose of this chapter is to highlight the niche for further studies on DNA 
replication and to direct this powerful and fascinating discipline toward a very exciting but 
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very challenging nervous system. The extremely high need for studies of DNA replication in 
neural tissue is reflected by Pub Med searches using two different sets of key words. DNA 
together with synthesis generated about 70,000 hits, while by adding nervous system to this 
equation we limited the number to about 400. We believe that the conclusion is obvious. 

2. Injury to the adult CNS induces neural proliferation: hope for self repair 

Despite the fact that neurogenesis was first described in the adult mammalian brain nearly 
100 years ago (Allen 1912), adult neurogenesis has been accepted by the scientific 
community in the last four decades. One year after Ezra Allen's report of neurogenesis in 
the adult rat brain in 1912, Santiago Ramon y Cajal, arguably the most recognizable name in 
neuroscience was quoted as saying, "In the adult centers, the nerve paths are something 
fixed, ended and immutable. Everything may die, nothing may be regenerated." This claim 
effectively suppressed further investigations of adult neurogenesis for 50 years; until the use 
of tritiated thymidine by Joseph Altman in the 1960s (Altman 1962; Altman and Das 1965; 
Altman and Das 1967). Even then, the prevailing belief in the stability of the adult brain was 
upheld for roughly 30 years after these initial reports. During that time, more and more 
sensitive techniques were developed to detect cell division (Nowakowski et al. 1989; 
Nowakowski and Hayes 2001; Wojtowicz and Kee 2006). 

Currently, the idea of adult neurogenesis is widely accepted by the scientific community. 
However, definitive evidence for generation of new and functional neurons in the adult 
uninjured mammalian CNS has been restricted to the subventricular zone (SVZ) of the 
lateral ventricle and to the subgranular zone (SGZ) of the dentate gyrus in the hippocampus 
(Lee and Son 2009; Ehninger and Kempermann 2008; Taupin 2006; Ming and Song 2005; 
Kempermann et al. 2004; Alvarez-Buylla and Garcia-Verdugo 2002). The discovery of adult 
neurogenesis has also uncovered the exciting potential for novel approaches to repair the 
injured brain and spinal cord. Some animals, such as lizards, can regenerate their injured 
nervous system. The mammalian CNS does not have this capacity; however, it does appear 
to have a greater regenerative ability than was previously believed. The traditional idea that 
lineages for neurons and glia become set aside during development has been challenged, 
and it is now well accepted that progenitors for each cell lineage persist into adulthood 
(Gage 2000). Moreover, several studies show that neurogenesis is stimulated in the CNS in 
response to injury (Emery et al. 2003; Parent et al. 2002; Parent and Lowenstein 2002; Parent 
2003; Richardson et al. 2007; Shi et al. 2007). Understanding the developmental process and 
how injury is recapitulating these developmental stages may provide insights into the 
regenerative capacity of the adult nervous system and speed up the development of 
strategies to manipulate this neurogenic potential. 

Following brain injury, factors that stimulate proliferation of adult neural stem cells and 
their respective receptors are upregulated (Bambakidis et al. 2005; Yoshimura et al. 2001). 
Moreover, several studies have suggested that factors and mechanisms regulating neuronal 
proliferation, migration, differentiation and connectivity during development are 
reactivated by injury in the adult nervous system (Zhang et al. 2005; Sun et al. 2007). These 
factors appear to be sufficient to direct neural precursors to migrate toward a lesion area, 
differentiate into neurons, mature and establish synaptic contacts (Mori et al. 2005; Fricker- 
Gates et al. 2002). Transition from stem cells to fully differentiated neurons progresses 
through defined steps, and different classes of neuronal precursors can be distinguished by 
their morphology, expressed markers, and mitotic activity (Encinas and Enikolopov 2008). 
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Moreover, several lines of evidence suggest that glia participate in injury-induced 
neurogenesis, suggesting that they serve as both stem cells and progenitors from which 
neuroblasts develop (Englund et al. 2005; Gubert et al. 2008; Alonso et al. 2008; Alvarez- 
Buylla and Lim 2004; Hevner 2006). During CNS development, both the radial glia and 
astrocytes were reported to undergo proliferation and differentiation toward both the 
neurons and glia (Costa et al. 2009). Moreover, behaving as post-mitotic cells under 
physiological conditions, following injury glia may undergo phenotypic and functional 
changes and dedifferentiate. It has been previously shown that after injury, quiescent 
astrocytes reveal features of immature stages or phenotypes of stem cells and proliferate 
(Buffo et al. 2008). These proliferating astrocytes however, undergo symmetric divisions and 
remain within their cell lineage in vivo, while an altered in vitro environment revealed their 
self-renewal and multipotency. 

Based on the current knowledge, the most probable scenario in post-injury brain repair is 
recruitment of endogenous progenitors to replace lost neurons. Proliferation of neural 
progenitors in germinal centers can be induced by delivery of trophic factors, such as FGF-2 
(Yoshimura et al. 2003; Yoshimura et al. 2001). Recruitment of endogenous progenitors has 
also been achieved through release of brain-derived neurotrophic factor (BDNF) and 
induction of neural differentiation (Chmielnicki et al. 2004; Benraiss et al. 2001). The in vivo 
induction of massive proliferation, directed migration, and differentiation of neural cells in 
the forebrain of the adult rat after lesion of the substantia nigra were observed after 
infusions of transforming growth factor alpha into the forebrain (Fallon et al. 2000). Studies 
of injury-induced neurogenesis in CNS generate substantial enthusiasm for the recruitment 
of endogenous progenitors as the therapeutic strategy for CNS repair. However, the 
successful development of stem cell recruitment therapy depends on the ability to control 
proliferation, differentiation, and functional integration of induced pluripotent cells. In 
order to develop therapies that can recruit endogenous progenitors to repair damaged areas 
of the adult CNS, it is also important to understand how neural environment changes 
following injury. 

While injury-induced neurogenesis in mammals does not lead to complete recovery, we 
believe it represents the brain's attempt to repair itself. However, in adulthood, CNS has 
been associated with a very limited regenerative response to injury, compared to embryonic 
CNS (Fry and Saunders 2000; Nicholls and Saunders 1996; Fawcett 1992). Therefore, an 
understanding how to induce and control self repair process in the brain to promote 
recovery from injury or neurodegenerative diseases would be the main challenge in 
regenerative medicine of the 21 st century. Reviewed in subsequent sections of this chapter is 
recent evidence that after injury, the CNS in adults activates cascades of events and gene 
expressions that are normally observed during development. We will discuss studies 
showing that under normal circumstances, damage to the CNS, including stroke and 
traumatic brain injury (TBI), as well as the experimental use of neurotoxins and other 
external agents in both the brain and the spinal cord result in an increased cell proliferation 
and neurogenesis in quiescent regions of the CNS. 

2.1 Stroke-induced neural proliferation 

Stroke, or cerebrovascular accident, results from the middle cerebral artery occlusion 
(MCAO) leading to brain infarction, is one of the common causes of death and disability in 
adulthood. However, improvement of brain functions has been reported following stroke. 
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implying the ability for brain repair (Pascual-Leone et al. 2005). One means by which repair 
might be achieved is neurogenesis, which occurs in the normal adult brain, and is increased 
in animal models of brain injury including stroke (Katakowski et al. 2007; Jin et al. 2006; 
Zhang et al. 2004). Post-stroke neurogenesis is associated with migration of newborn 
neurons from neurogenic niches to injured regions of the brain (Arvidsson et al. 2002). 
Neuronal precursors proliferate in the SVZ, and newly formed neurons migrate into the 
ischemic lesions and differentiate to acquire the phenotype of the neurons which were lost. 
However, the outcome of ischemia-induced neurogenesis is not robust and does not 
produce large numbers of new neurons. Few of the newborn neurons survive for an 
extended period of time after the insult, and this population is significantly smaller when 
compared to the population of migrating neuroblasts. There is also little evidence to suggest 
that those newborn neurons that do survive become functional and integrate into the CNS 
circuitry. 

On the other hand, the response of the CNS after injury is not limited to the production of 
neurons. Induced pluripotent cells may also differentiate into glia. Radial glia function as 
progenitors and provide a cellular source for neurogenesis during development and may 
also behave as neural progenitors in the adult CNS by proliferating and generating 
astrocytes and neurons. Stroke induced by MCAO in adult rodents triggers increased 
neurogenesis in the SGZ and SVZ (Abrahams et al. 2004; Jin et al. 2006; Katakowski et al. 
2007; Liu et al. 2007). The SVZ neuroblasts migrate into the damaged striatal areas and 
adopt the phenotypes of the projection neurons. Previously published studies show that 
stroke increases the production of neuroblasts in the SVZ, which express doublecortin and 
polysialylated neural cell adhesion molecule (Arvidsson et al. 2002; Parent et al. 2002); 
however, few cells survive to become functional neurons, and this percentage is low 
compared with the substantial increase in proliferating and migrating neuroblasts 
(Arvidsson et al. 2002; Parent et al. 2002). It has been also shown that after MCAO, the 
population of BrdU-positive cells increases in the SVZ, reaching the maximum around a 
week after ischemia (Arvidsson et al. 2002; Parent et al. 2002). Next, the population of 
doublecortin-expressing neuroblasts migrates outside the SVZ, as observed during their 
normal migration along the rostral migratory stream. Stroke-activated neuroblasts may also 
migrate into the injured striatum (Arvidsson et al. 2002; Parent et al. 2002). These cells 
become striatal projection neurons and express calbindin, markers of striatal medium spiny 
neurons (Arvidsson et al. 2002; Parent et al. 2002). Several studies show that cortical lesions 
produced by MCAO, significantly increase SVZ cell proliferation (Ohab et al. 2006; Gotts 
and Chesselet 2005). After MCAO cell proliferation also increases in the SGZ, even though 
this area is not directly affected by the stroke, resulting in the production of new neurons in 
granular zone of the dentate gyrus (Jin et al. 2001). Together, these studies indicate that 
mitotically active cells, and not mature cells proliferating in response to injury, comprise the 
majority of proliferating populations after brain ischemia (Jin et al. 2001). 

2.2 Traumatic brain injury and neural plasticity 

Traumatic brain injury (TBI) is the most common brain injury in humans (Graham et al. 
2000). After such injuries, the brain shows a remarkable ability for functional recovery 
(Nakabayashi et al. 2007). While, mechanisms responsible for this recovery are mostly 
unknown, the vast majority of neuroscientists hypothesize that synaptic plasticity is 
responsible for adaptive changes in the CNS following the injury (Zhang et al. 2011; Falo et 
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al. 2008; Scheff et al. 2005). In the last decades, there has been increasing interest in the 
ability to provide a continuous renewal of neurons in the adult CNS. This increased interest 
popularized the alternative mechanism for a recovery following TBI: the injury-induced 
neurogenesis. In CNS injury in general and TBI in particular, there is little data to suggest 
how damage to the nervous system may induce the proliferating population to differentiate 
into functional neurons. The neural environment changes associated with CNS injury are 
initially related to excitotoxicity associated with glutamate release (Hinzman et al. 2010; 
Gong et al. 1999; Globus et al. 1995). This is followed by an intense inflammatory response 
due to activated microglia, macrophages and trophic factors (Khuman et al. 2011; Hellewell 
et al. 2010). Secondary neuronal damage involves delayed cell death by apoptosis (Kim et al. 
2010; Dressier and Vemuganti 2009), and then astrocytes form a scar that acts as a barrier to 
axonal regrowth. 

While processes of synaptic plasticity have been well documented over the last decades, 
very few experiments have directly addressed the role of newly generated cells in the post- 
TBI remodeling of the CNS. To test this hypothesis, several groups studied the effects of 
experimental TBI on hippocampal neurogenesis (Emery et al. 2005; Lu et al. 2011; Kernie 
and Parent 2010). They detected significant increase in the number of dividing cells in 
dentate gyrus. Moreover, a significant increase in the number of new cells expressing 
neuronal markers after appropriate maturation periods has been reported (Sun et al. 2007; 
Sun et al. 2005). This fact suggests that injury-induced DNA replication and cell proliferation 
in the adult CNS is primarily neurogenic. 

2.3 Neural proliferation after spinal cord and brainstem injury 

Spinal cord injury research has greatly expanded in recent years. However, understanding 
of the mechanisms triggering the functional recovery following the neuronal damage is still 
incomplete. After spinal cord injury (SCI), tissue damage occurs at the impact site and 
spreads over time (Tederko et al. 2009). This lesion is accompanied by apoptosis of 
oligodendrocytes and loss of myelin around surviving axons (Wu and Ren 2009). Axonal 
demyelinization typically peaks during the few days after injury (Wu and Ren 2009; Salehi 
et al. 2009). Remyelinization by oligodendrocytes and Schwann cells usually begins by a few 
weeks after damage to the spinal cord (Sasaki et al. 2007; Dasari et al. 2007). Several studies, 
however, suggest that mature oligodendrocytes do not proliferate in response to trauma 
(Keirstead and Blakemore 1997; Amat et al. 1998; Redwine and Armstrong 1998). However, 
more recent studies show that neurogenesis can be induced in the spinal cord following 
specific types of injury. A substantial gliosis was previously reported in the lesion area, 
which resulted in the formation of a glial scar (Fitch and Silver 2008). However, generation 
of new and functional neurons was not observed after hemisection or lesion of dorsal 
column (Vessal et al. 2007; Yang et al. 2006). They revealed significant gliogenesis and 
reported that new born oligodendrocytes remyelinated host axons in the injury site. 
Therefore, new born glia may contribute to the neural repair following the damage to the 
spinal cord. 

Studies of neural proliferation in the adult brainstem revealed that production of new 
neurons occurs in vivo within the dorsal vagal complex (DVC) in adult rats (Bauer et al. 
2005). They detected within the DVC newly generated BrdU-positive cells which 
simultaneously expressed markers of immature and mature neurons (DCX, HuC/D, TUC-4, 
NeuN). It has been also reported that vagotomy (DVC deafferentation) triggered a large 
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increase of BrdU incorporation in the ipsilateral DVC, associated with proliferation of 
microglia and genesis of neurons and astrocytes (Bauer et al. 2005). Injury-induced 
plasticity, neural proliferation and neuronal replacement were recently revealed after 
damage to pre-Botzinger Complex, a cluster of interneurons in the ventrolateral medulla of 
the brainstem (Neumueller et al. 2011). Neuronal counts and evaluation of neuronal density 
revealed up to 65% more neurons within the ventral and lateral medulla compared to 
control animals. Concluding studies dealing with neural proliferation in the brainstem we 
can hypothesize that injury activates plasticity and reorganization of brainstem circuits, late 
differentiation and even neurogenesis. However, more rigorous studies are necessary to 
definitely prove injury-induced neurogenesis in the adult brainstem. 

3. Neural proliferation in the adult PNS: challenging the dogma 

Whereas in the CNS the occurrence of DNA replication and neural proliferation in adult 
mammals has been broadly accepted from the end of the twentieth century (Johansson et al. 
1999), in the PNS it was already documented, or at least hypothesized much earlier. The 
occurrence of adult neurogenesis in the PNS was suggested by Hatai several decades earlier 
(Hatai 1902). He reported an increase in sensory neuron counts in DRGs from one-month to 
five-month-old rats. Later, Miura provided the first evidence of an increase in neuron 
numbers in the autonomic nervous system, namely in Auerbach's plexus of the rat small 
intestine (Miura 1913). A conformation of the possibility that neurogenesis occurs in the 
autonomic nervous system was later provided by Benninghoff, who showed that partial 
stenosis of rat small intestine induced, in the upstream intestinal loops, an increase in 
Auerbach's plexus ganglion neuron numbers (Benninghoff 1951). Moreover, Filogamo and 
coworkers demonstrated that in the same experimental model, neurogenesis led to a 
fourfold increase in the number of Auerbach's plexus ganglion neurons of the loops 
upstream from the intestinal obstruction (Filogamo and Vigliani 1954). Yet, the occurrence 
of adult neurogenesis was revealed in other sites of the autonomic nervous system including 
DRGs in the rat (Devor and Govrin-Lippmann 1985; Devor 1991; Popken and Farel 1997). 
Studies of neural proliferation in the adult PNS were previously reviewed by Geuna (Geuna 
et al. 2002). 

It should be pointed out that the above-mentioned studies, that were based on neuron 
counts led authors to the conclusion that, since there was no evidence of mitosis by nerve 
cells detectable by colchicine blocking, the observed increase in neuron number should be 
attributed to the persistence of poorly differentiated or undifferentiated cells capable of 
turning into neurons during adulthood and/or under the influence of exceptional stimuli. 
The introduction of the techniques for investigating DNA replication, especially 3H- 
thymidine (and afterwards BrdU) administration, cytophotometry/cytofluorimetry and S- 
phase markers (e.g. PCNA and Ki67), opened new horizons for interpreting the adult 
neuron addition. In a series of studies, Giacobini-Robecchi and coworkers showed the 
presence of DNA replication (by autoradiography after 3H-thymidine administration and 
PCNA immuno-staining) in myenteric neurons from the small intestine loops upstream 
from a partial stenosis (Corvetti et al. 2001; Poncino et al. 1990; Giacobini Robecchi et al. 
1988). However, cytophotometry after Feulgen staining showed that myenteric neuron DNA 
replication might not be due to a tetraploid DNA content related to cell division, but instead 
to unscheduled DNA synthesis that leads to a hyper diploid DNA content (Poncino et al. 
1990; Giacobini Robecchi et al. 1988). Electrophoretic analysis of total genomic DNA has also 
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suggested that unscheduled DNA synthesis can be due to DNA amplification (Giacobini 
Robecchi et al. 1995). 

The evidence of DNA replication in the adult sensory ganglia has been also provided in 
Ciaroni's and Czaja's laboratories (Cecchini et al. 1995; Ciaroni et al. 2000; Czaja et al. 2008; 
Gallaher et al. 2011; Ryu et al. 2010). However, it should be noted that the occurrence of 
adult neurogenesis in the PNS has been questioned by several experimental studies that 
failed to confirm the numerical increase of adult neurons in both the myenteric plexus 
(Gabella 1984) and DRGs (La Forte et al. 1991; Pover et al. 1994; Bergman and Ulfhake 1998). 
In this view, cell counting has raised a lively debate over the last years in the neuroscience 
field, and it appears that in many cases the discrepancy in the results might be due to bias in 
the counting methods used (Popken and Farel 1997; Geuna 2005). Studies based on unbiased 
stereological neuronal counts have confirmed again the occurrence of neuron number 
increases in adult sensory ganglia (Lagares et al. 2007), thus raising a question about the 
origin of the new neurons. Two possibilities have been tentatively proposed to explain the 
increase in the number of neurons during adulthood: 1) de novo neurogenesis (i.e., based on 
DNA replication and cell division); 2) neuron addition (i.e., late differentiation of poorly 
differentiated neuronal precursors without cell division) (Farel 2001; Farel 2002; Geuna et al. 
2000). Although the detection of DNA replication (Ciaroni et al. 2000; Farel 2001; Farel 2002; 
Geuna et al. 2000; Giacobini Robecchi et al. 1988) is considered a predictor of cell division 
and thus de novo neurogenesis, it might also be due to selective amplification of a few DNA 
sequences (Corvetti et al. 2001; Giacobini Robecchi et al. 1995). On the other hand, the 
possibility that the neuron number increase is due to the late differentiation of slightly 
immature neurons is partially ruled out by the difficulty in identifying (even with careful 
electron microscope investigation) the immature neurons in adult DRGs (Geuna, 
unpublished). Therefore, a third tentative explanation, which falls between the other two 
options, is being sought, namely the possibility that a neuronal precursor niche from neural 
crest origin persists all along adulthood in the PNS (Gallaher et al. 2011; Lagares et al. 2007). 
Although, sensory ganglia have been reported to contain precursor cells (Arora et al. 2007; 
Lagares et al. 2007; Li et al. 2007) that can proliferate in vitro and can be induced to 
differentiate into neurons (Liu et al. 2009; Namaka et al. 2001), the possibility of in vivo adult 
neurogenesis in the adult PNS is almost completely unexplored. Moreover, the few 
published studies have resulted in contradictory conclusions. For example, age-related 
increases in the number of dorsal root ganglia (DRG) neurons have been reported in rat by 
two groups (Popken and Farel 1997; Ciaroni et al. 2000), while other investigators have 
provided data suggesting that adult neurogenesis does not occur in the adult DRG (Pover et 
al. 1994; Mohammed and Santer 2001). Thus, there is as yet no consensus regarding the 
neurogenic potential of the adult PNS sensory ganglia. 

3.1 Injury-induced neurogenesis in sensory ganglia 

A strong evidence for the potential of PNS cells to produce new neurons arises from Czaja's 
laboratory. They have examined the sequels to toxin-induced neuronal destruction. Thus, 
evidence of neuronal replacement following lesions, which has been reported by several 
investigators in CNS models, (Hou et al. 2008; Lie et al. 2004; Parent 2003), is limited to our 
recent reports for the PNS (Czaja et al. 2008; Gallaher et al. 2011). Our key observation is that 
neuronal losses following capsaicin-induced neuronal destruction are not permanent. In 
fact, following a precipitous post-capsaicin decline, the numbers of neuronal nuclei in 
nodose ganglion (NG) from capsaicin-treated rats equaled or exceeded the numbers found 
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in control rats by 60 days after capsaicin treatment. Furthermore, we observed that 
restoration of neuronal numbers after capsaicin was accompanied by BrdU incorporation, 
which ultimately labeled mature neurons (Czaja et al. 2008). Our observation of apparent 
neuronal number restoration in NG led us to hypothesize that damage to the primary 
sensory neurons induces proliferation of endogenous progenitors and/or lineage 
reprogramming and generation of induced multipotent stem cells (iMSCs), which 
differentiate into new neurons. The satellite cell population housed in DRG or NG may, 
under adequate conditions, represent a source of multipotent cells for the new neurons (Li et 
al. 2007; Singh et al. 2009; Gallaher et al. 2011). Moreover, morphological observations have 
recently provided further support to this hypothesis and revealed that NG cultures from 
capsaicin-treated rats contained bipolar neurons, normally found only during development 
(Gallaher et al. 2011). Furthermore, we observed that restoration of neuronal numbers after 
capsaicin was accompanied by BrdU incorporation that ultimately labeled mature neurons 
(Czaja et al. 2008; Gallaher et al. 2011). Our observation of apparent restoration of neuronal 
numbers in NG led us to hypothesize that damage to the primary sensory neurons induces 
proliferation of endogenous progenitors and/or lineage reprogramming and generation of 
induced multipotent stem cells (iMSCs) which differentiate into new neurons. The existence 
of ganglionic MSCs would suggest that PNS could serve as a source of committed 
autologous cells that are capable of being stimulated to produce neurons in vivo, thus 
avoiding an immune rejection. The discovery of neural progenitors and mechanisms 
involved in the induced neurogenesis in the adult PNS could enable autologous grafting of 
new neurons into damaged areas of the CNS to replace neurons lost due to injury or disease. 

4. Mechanisms involved in post-injury DNA replication and cell proliferation 

In the brain, new neurons are formed from neural stem cells (NSCs), which are multipotent 
and self renewing. Neural stem cells can undergo symmetric or non-neurogenic divisions 
that produce two neuro-epithelial stem cells and cause horizontal expansion of the 
proliferative population (Gotz and Huttner 2005). Alternatively, they engage in asymmetric 
divisions that are associated with the generation of neuronal progenitors (Falk et al. 2008). 
The DNA synthesis and proliferation of NSCs are critical processes which regulate the size 
and neurogenic potential of the nervous system. Multiple signaling pathways have been 
implicated in the regulation of neural stem and progenitor-cell proliferation (Riccio 2010; 
Suh et al. 2009). Signalling pathways controlling NSCs niche and signaling after brain injury, 
outside NSCs niche share a significant overlap converging on two canonical pathways, 
mammalian target of rapamycin (mTOR) and mitogen-activated protein kinase (MAPK) 
(Robel et al. 2011). 

One of the key pathways regulating cell proliferation in the nervous system is WNT 
(wingless-type MMTV integration site family)/ p-catenin signaling (Wexler et al. 2009) 
WNT signaling regulates neural plasticity during development and in the adult nervous 
system. During development, p-catenin signaling regulates proliferation of neuronal stem 
cells, leading to the correct number of neurons in the developing embryo. WNT also 
regulates proliferation and preserves pluripotent and multipotent state of NSC and neural 
progenitors in the adult brain by keeping them in a dividing stage that prevents 
differentiation (White et al. 2010). WNT signaling operates through autocrine signaling loop 
among NSCs. NSCs express WNTs ligands and WNT receptor (Wexler et al. 2009).WNTs 
are glycoprotein morphogens that interact with G-protein-coupled receptors and initiate 
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different signaling pathways, including the canonical pathway involving p-catenin. Binding 
of WNT to the receptor triggers the p-catenin-LEF/TCF signal transduction and activation 
of a WNT responsive target gene, such as cycline Dl, which promotes cellular proliferation. 
Over-expression of WNTs or persistent activation of the downstream signaling component 
b-catenin in the CNS promotes cell-cycle progression and negatively regulates cell-cycle exit, 
resulting in horizontal expansion of precursor populations (Chenn and Walsh 2002). 

4.1 Immune system and regulation of neurogenesis 

Brain injury induces an excess of factors that modulate plasticity of various cells and induce 
neurogenic potential within damaged tissue. Several lines of evidence indicate that various 
components of the immune system may positively regulate neurogenic processes (Yirmiya 
and Goshen 2011; Ziv and Schwartz 2008) . Brain damage activates microglia, which rapidly 
migrate to the injury site and initiate recruitment and activation of macrophages and 
lymphocytes. These cells release a number of anti- and pro-inflammatory factors creating a 
positive feedback loop that results in neural damage and causes both detrimental and 
positive consequences to neurogenesis. It has been demonstrated that upon severe injuries 
such as hypoxia or stroke, a portion of microglia proliferate. Furthermore, microglia also 
reveal in vitro stem cell potential with self-renewal and multipotency (Robel et al. 2011). 
Several in vitro and in vivo studies have demonstrated that lymphocytes, specifically T cells 
may promote neurogenesis. T-cells derived cytokines IL-4 or INF-y activate microglia cells 
(Ziv and Schwartz 2008). Interestingly, WNT- and p-catenin signal increases in adult 
proliferating astrocytes and in NG2 glia after traumatic brain injury, suggesting that WNT 
signaling plays a role in the control of gliogenesis following cortical injury (White et al. 
2010). Growth factors including FGF2, EGF and VEGF are released by various cell types 
after brain injury and activate mTOR or mitogen-activated protein kinase (MAP) pathways. 
These growth factors also activate ERK and Jun N-terminal kinase (JNK)-dependent MAPK 
pathways in astrocytes after injury. These signaling pathways act synergistically to 
phosphorylate the immediate early gene products c-FOS and c-JUN which increase 
proliferation of astrocytes. Phosphorylation of ERK is specifically detected in reactive 
astrocytes after injury (Robel et al. 2011). 

The activated microglia seemed to exert the neurogenic effect via expression of the anti- 
inflammatory/neuroprotective cytokine transforming growth factor-beta (TGF-pi), which 
has a (concentration-dependent) neurogenic effect by acting on the proliferation of neural 
progenitors (Battista et al. 2006; Ziv and Schwartz 2008). TGF-pi decreases the expansion of 
neural stem and precursor cells in a dose-dependent manner (Aigner and Bogdahn 2008). 
While WNT positively controls expansion of neural stem and progenitor pools, 
transforming growth factor p (TGF-P) upregulates cell cycle inhibitors and counteracts cell 
cycle progression. Therefore TGF-p signaling controls the size of specific brain areas by 
antagonizing canonical WNT signaling and negatively regulating self-renewal of 
neuroepithelial stem cells. TGF-p has been associated with neuronal differentiation and 
survival in CNS under physiological conditions (Falk et al. 2008). Post-injury cell renewal, 
such as precursor cell proliferation in CNS, is positively regulated by microglia-derived 
TGF-p (Ziv and Schwartz 2008). Three isoforms of TGF-p (TGF-P 1, 2, 3) are expressed in 
neurons and in glia cells. TGF-pi isoform is predominantly upregulated and activated after 
CNS lesions or, in cases of neurodegeneration, TGF-p signals through activation of TGF-p 
receptor type I and II (Tgfbrl and Tgfbr2) and phosphorylation of the signaling mediators 
Smad2 and Smad3 (Aigner and Bogdahn 2008). TGF-p not only activates the Smad signaling 
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Fig. 1. Mechanism involved in post-injury cell proliferation in CNS 

Injury to the brain triggers complex responses between populations of immune cells 
(yellow circle), microglia, neural stem cells (NSC), neural progenitors and differentiated 
neurons (all shown as blue circle). The NSC and neural progenitors represent the main 
precursors of neurons. Two antagonistic signalling pathways WNT and TGF-p regulate 
proliferation and differentiation in adult neurogenesis. While WNT positively control 
expansion of neural stem and progenitor pools, TGF-p counteracts cell cycle progression 
and is associated with neuronal differentiation and survival in CNS. Injury-activated 
microglia emerges as a new potential source of NSC and neural progenitors. T-cells may 
promote neurogenesis by interacting with microglia. T-cells derived cytokines (IL-4 or 
INF-y) activate microglia cells. The activated microglia seemed to exert the neurogenic 
effect via expression of the anti-inflammatory/neuroprotective cytokine TGF-pi. Growth 
factors including FGF2, EGF and VEGF are released by various cell types after injury, 
bind to receptors (blue bars) expressed at the surface of activated microglia cells and 
initiate intracellular signal transduction and activate mTOR, ERK and MAPK pathways 
in astrocytes. These signalling pathways act synergistically to increase proliferation of 
astrocytes and they are also involved in up regulation of the neurogenic genes (Pax6, 
NEROG2, MASH1). WNT- and p-catenin signalling also increases in adult proliferating 
astrocytes after traumatic brain injury. Glia cells were successfully reprogrammed to NSC 
or neural progenitors in vitro by induced over expression of neurogenic genes. Adult, 
injury-induced cellular proliferation during neurogenesis is regulated by cytokines (TNF- 
a, LIF) and morphogens WNTs. Signalling from these molecules has a common target, 
cycline-Dl which triggers cell cycle entry and DNA synthesis. 
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pathway, but it may diverge into and modulate other known classical signaling cascades 
such as RAS/ERK/MAPK. Activation of RAS/ERK/MAPK signaling by TGF-p can induce 
an autocrine amplification loop of TGF-pi expression. It has been reported that adult rat 
dorsal root ganglion (DRG) neurons after injury are able to synthesize TGF p-1 isoform, 
which is able to elicit Schwann cell proliferation and modulates regeneration processes in 
the PNS (Rogister et al. 1993). 

Immune responses modulate the NSCs. Proliferation of NSCs is regulated by cytokines and 
immuno-modulating polypeptides (Gonzalez-Perez 2010). TNF-a is one of the crucial 
inflammation mediators. In the CNS, TNF-a binds to TNF receptors expressed by glia cells 
and neurons. Pro-inflammatory cytokine TNF-a has been associated with the pro- 
neurogenic role by increasing proliferation of NSCs through activation of NFkB and TGF-p 
activated kinase-1 (TAK-1) signaling cascade. NFkB transcriptionally regulates cyclin-Dl in 
NSCs (Fig.l). The cyclin-Dl/CDK4-complex is necessary for NSC cell cycle progression by 
promoting passage through Gl/S restriction point (Widera et al. 2006). Injury to the 
peripheral nervous system results in the upregulation of TNF-a. TNF-a is released from 
neurons, Schwann cells and proliferating satellite cells in DRG. The role of the TNF-a in the 
injury of peripheral nervous system is not well understood; however it may be an important 
inducer of the proliferative potential of cells. 

Leukemia inhibitory factor (LIF) is another cytokine that displays pro-neurogenic effects in 
the injured adult brain. LIF has been demonstrated as a key signal for injury-induced 
neurogenesis in adult mouse olfactory epithelium (Bauer and Patterson 2006). LIF appears 
to promote NSC self-renewal and expansion. LIF stimulates the initiation of DNA synthesis 
and cell division through a common signaling mechanism that involves MEK/ERK 
activation as well as STAT1 cytoplasmic nuclear translocation. LIF acts on NSC and 
maintains them in a loop of symmetric/self-renewing divisions in vivo (Bauer and Patterson 
2006). LIF stimulates DNA synthesis through MEK/ERK; this is a direct mechanistic 
molecular link between tissue damage inflammation and tissue renewal. The ERK cascade is 
activated by hormones, cytokines, and growth factors that result in either proliferation or 
growth arrest, depending on the duration and intensity of the ERK activation. This pathway, 
consisting of Raf, MEK1/2, and ERK1/2, regulates cell proliferation via its impact on cell 
cycle control. Activation of this mitogen-activated protein kinase pathway promotes the 
expression of cyclin-Dl. These key events result in the activation of the cyclin-dependent 
protein kinase CDK4/6, which promote cell cycle entry by phosphorylation of the 
retinoblastoma tumor suppressor (Rb), leading to the release of the transcription factor E2F. 
This in turn promotes the transcription of cyclins A and E, resulting in the activation of 
CDK2. Activated CDK2 phosphorylates Rb at additional sites and, thus, enables DNA 
synthesis and centrosome duplication via further liberation of E2F. Furthermore, sustained 
expression of p21cipl has been shown to be responsible for the ERK-mediated proliferation 
inhibition (Ussar and Voss 2004). 

4.2 Mechanisms involved in the neurogenic potential of glia 

Brain studies provide increasing evidence that adult glia could be a source of new neurons 
(Robel et al. 2011). Glia in the adult brain may act as neural progenitors and neural stem 
cells. Following lesion-induced reactivation, astrocytes dedifferentiate to astrocyte-like stem 
cells. Several studies suggest that the WNT/(3-catenin pathway may be involved in the 
dedifferentiation process following injury. Interestingly, injury-induced WNT/b-catenin 



Injury-Induced DNA Replication and 

Neural Proliferation in the Adult Mammalian Nervous System 


271 


signaling has already been shown to be required for the dedifferention of other cells such as 
epithelial cells to form hair follicles and Mueller glia to form retinal cells (Osakada et al. 
2007). Remarkably, WNT signaling has been associated with peripheral nerve injury. 
Peripheral nerve lesions induce upregulation of the WNT signaling mediator Ryk on DRG 
neurons, which may potentially indicate a role of WNT in neuroregeneration and/or 
neurogenesis (Li et al. 2008). 

Pax6 neurogenic transcription factor plays a crucial role in neurogenesis, both during 
development and in adulthood. It is expressed by NSC. Astrocytes forced to express Pax6 
acquired neuronal morphology. PAX6 and other neurogenic factors such as NEUROG2, 
MASH1 signaling can cause a true glia-to-neuron conversion in vitro. NEUROG2 is 
epigenetically silenced in postnatal astrocytes. Upregulation of neurogenic genes could be 
induced following injury in proliferating reactive astrocytes, which would acquire potential 
to dedifferentiate and be reprogrammed to neurons (Robel et al. 2011). Given that the 
retroviral vectors used in these studies incorporate only in proliferating cells and that 
neurons are post mitotic, it seems that the origin of these new neurons is the glia whose 
proliferation is enhanced after injury. Interestingly, some signals promoting astrocytes 
dedifferentiation may derive from injured astrocytes themselves, as suggested by scratch 
wound-injured rat spinal cord astrocytes (Robel et al. 2011). 

Sox2 is a member of the SRY-related group of transcription factors that plays a significant 
role during neural cell development. Sox2 maintains the pluripotency and developmental 
potential of neural stem cells (Li et al. 1998). It has been shown that expression of Sox2 and 
the transcription factors Oct4, c-Myc and Klf4 is sufficient to reprogram adult fibroblasts to 
generate pluripotent stem cells (Chang et al. 2010; Park et al. 2008). The constitutive 
expression of Sox2 has been shown to inhibit neuronal differentiation, while suppression of 
Sox2 leads to cell cycle exit and neuronal differentiation (Graham et al. 2003). Sox2 
expression is also upregulated following injury to the peripheral nerves. It has been shown 
that Sox2 expression was elevated in glia cells in Krox20 knockout mice (Le et al. 2005). 
These results indicate that Sox2 is suppressed, either directly or indirectly, by Krox20. 
Krox20 and Sox2, like Krox20 and c-Jun, inhibit each other. It has been suggested that c-Jun 
controls Sox2 levels and that some of the inhibitory effects of c-Jun are channeled by Sox2 
(Parkinson et al. 2008). Other studies indicate that Sox2 acts downstream of Notch 
(Woodhoo et al. 2009). Sox2 expression is essential to specify glia cell fate from neural crest 
cells. 

4.3 The molecular events involved in damage-induced DNA replication and cellular 
proliferation in PNS 

The neurogenic potential of PNS has been recently reported (Gallaher et al. 2011). Formation 
of new neurons in the adult NG was followed by capsaicin-induced neuronal damage and 
cell death via apoptosis. Cell proliferation was detected by BrdU incorporation during DNA 
synthesis, which is a standard marker of dividing cells. The molecular events involved in 
damage-induced DNA replication and cellular proliferation in PNS are not well understood. 
However, by analogy to damage-induced neurogenesis in CNS, components of the immune 
system may play important regulatory roles. This will be under stress and tissue damage 
condition where components of immune system such as pro-inflammatory cytokines may 
play a role of molecular triggers affecting cell cycle of differentiated neurons. 
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Fig. 2. Post-injury degeneration and regeneration in sensory ganglia 

The adult sensory ganglia comprises TRPV-1 expressing (A: green) and non-TRPV-1 
expressing neurons (A: blue). Following injury, many neurons will completely 
degenerate (B: cell body with fragmented nucleus) while others only lose peripheral 
projections (B: red nucleus). Injury to the NG and loss of neurons will result in the active 
response from neurons, glia and immune system through upregulation of various 
cytokines, growth factors and morphogens. These factors may induce DNA synthesis 
revealed by BrdU incorporation (C, D: yellow) and cell proliferation (D: blue arrows). 
This cascade of events may change the ganglionic environment and induce 
dedifferentiation (D: red arrows) of satellite glia cells (SGC), Schwann cells (SchC), glia 
progenitors (GPC) and neuronal progenitors (NPC) to less-differentiated stage within 
their own lineage. Dedifferentiated cells may next differentiate (D: green arrows) or 
transdifferentiate (D: purple arrow) to switch the cell lineage. The mechanisms of 
differentiation, dedifferentiation and transdifferentiation may be responsible for injury- 
induced neuronal replacement and generation of sensory neurons (SN) in the ganglia. 


It has been well known that adult sensory ganglia are composed of terminally differentiated 
neurons and glia cells. However, recent studies show that the pool of immature glial 
precursor cells could be present in the adult sensory ganglia (Gallaher et al. 2011; Lagares et 
al. 2007). Injury to sensory ganglia triggers an inflammatory response and bursts of pro- 
inflammatory cytokines, growth factors, and morphogens, which are known to exert their 
effects through common signaling pathways and modulate neuronal plasticity. Specifically, 
TNF-a, TGF-p, and WNT have been activated after a peripheral nerve lesion (Li et al. 2008; 
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Saade et al. 2002). The MAP Kinases pathway (MAPK/ERK/p38) also has been reported to 
activate in peripheral nerve injury (Agthong et al. 2006). In the sensory ganglia injury 
initially damages peripheral axons and depending on the damaging factor, some neurons 
will die and some will survive and regenerate (Fig. 2A-C). Moreover, the cascade of 
degenerative events may induce molecular signaling triggering proliferation, differentiation 
and dedifferentiation of ganglion cells, which provide a source of neural progenitors or 
induced multipotent cells for new neurons (Fig. 2D). 

The identity and function of differentiated, post-mitotic cell is no longer a permanent 
feature. Studies of the last decade suggest that cell identity is determined by transcriptional 
and/or epigenetic actions and, more importantly, that it is subject to reprogramming by 
resetting intrinsic programs (Vogel 2010). The ability of a cell to dedifferentiate to a 
pluripotent state is critically dependent on the ability of that cell to re-enter cell cycle and 
replicate its genome. The molecular forces triggering change in cellular identity are not well 
understood. Injury may trigger programs of regeneration, which may involve changes in 
cell plasticity and identity. This principle may underline the injury-induced plasticity, 
lineage reprogramming and neurogenesis in the adult PNS. Therefore, studies of injury- 
induced neurogenesis in the context of neural plasticity will provide new insights into the 
complexity of DNA replication and cellular identity at the tissue level. 

5. Conclusion 

The review of the current literature dealing with DNA synthesis, neural proliferation and 
neurogenesis in the adult nervous system challenges Cajal's dogma that "In the adult 
centers, the nerve paths are something fixed, ended and immutable. Everything may die, 
nothing may be regenerated." Moreover, more and more studies show that brain is not the 
only place in the adult nervous system where new neurons could be generated. The 
mammalian CNS and PNS appear to have a greater regenerative ability than was previously 
believed. The factors that regulate proliferation and differentiation in the uninjured adult 
brain may contribute to increased DNA synthesis and proliferation after injury. In the adult 
nervous system, development is complete or near complete and new cells are not in high 
demand. After injury, however, the need and ability for production of new cells in the adult 
brain radically increases. This demand may recapitulate developmental mechanisms in the 
injured adult nervous system. Thus, factors that influence NSCs during development may 
be more likely to contribute to injury-induced neurogenesis than factors regulating NSCs in 
the uninjured adult brain. Recent advances in regenerative research show that injury may 
induce cascade of events forcing the pool of endogenous multipotent cells to enter the cell 
cycle and differentiate toward glia or neurons. What is even more exciting, recent studies 
strongly support the hypothesis that terminally differentiated cells can be stimulated to go 
back to less-differentiated stage within their own lineage (dedifferentiation) in response to 
injury. Dedifferentiation may even go a step further and regress to a point where neural 
cells may switch lineage (transdifferentiation). However, further studies are necessary to 
determine the specific intrinsic and micro environmental cues that drive the injured nervous 
system for adult neurogenesis. Harnessing the mechanisms involved in the induced 
neurogenesis in the adult nervous system could enable neural replacement therapy as a new 
approach in the regenerative medicine. 
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1. Introduction 

The aim of this chapter is to show the consequences of seqA gene disruption in Salmonella 
typhimurium. Of special interest is the observation that this disruption causes asynchrony of 
DNA replication. In fact, lack of SeqA protein causes major changes in the lipid profile of the 
membrane cell which is implicated in the initiation of DNA replication. Moreover, during 
passage through the small intestine in the host. Salmonella typhimurium faces periodic release of 
bile and due to the absence of SeqA protein the sensitivity to this bile increases. This unknown 
sensitivity could be due to an altered membrane composition of phospholipids and fatty acids. 
Deoxyribonucleic acid (DNA) contains all the information required to build the cells and 
tissues of a prokaryotic or an eukaryotic organism. The exact replication of this information 
in any species assures its genetic continuity from generation to generation and is critical to 
the normal development of an individual. The information stored in DNA is arranged in 
hereditary units, known as genes, that control identifiable traits of an organism. 

Discovery of the structure of DNA and subsequent elucidation of how DNA directs 
synthesis of RNA, which then directs assembly of proteins-the so-called central dogma-were 
monumental achievements marking the early days of molecular biology. However, the 
simplified representation of the central dogma as DNA—>RNA—>protein does not reflect the 
role of proteins in the synthesis of nucleic acids. Moreover, proteins are largely responsible 
for regulating DNA replication and gene expression, the entire process whereby the 
information encoded in DNA is decoded into the proteins that characterize various cell 
types. One of these proteins is the DNA-Binding Protein SeqA. 

1.1 SeqA: the DNA-binding protein 

SeqA was discovered in some prokaryotes as a protein involved in the methylation / 
hemimethylation cycle of Escherichia coli DNA (Lu et al., 1994). SeqA binds to 
hemimethylated GATC sites formed by DNA replication and regulates activation of the 
Escherichia coli chromosome replication origin (Lu et al. 1994). Proper chromosome 
segregation also requires SeqA (Bach et al, 2003). Furthermore, SeqA trails the DNA 
replication fork and may contribute to nucleoid organization in newly replicated DNA 
(Brendler et al, 2000; Klungsoyr & Skarstad, 2004; Lobner-Olesen et al, 2003; Yamazoe et al. 
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2005). Aside from its roles in chromosome replication and nucleoid segregation, SeqA is 
known to regulate the transcription of certain genes. In bacteriophage lambda, SeqA 
activates the p r promoter in a GATC methylation-dependent fashion. 

SeqA also acts as a transcriptional coactivator by facilitating binding of the ell transcription 
factor to the lambda pi and p a q promoters. Competition between SeqA and the OxyR 
repressor for hemimethylated GATC sites has been shown to regulate phase variation in the 
Escherichia coli agn43 gene (Prieto et ah, 2007). These examples raised the possibility that 
SeqA binding to critical GATC sites might likewise regulate the expression of prokaryotic 
genes like in Salmonella, which is a member of the E nterobacteriaceae family. 

1.2 The genus Salmonella 

Much of recent research is focused on Salmonella (figure 1) which causes diseases ranging 
from food and blood poisoning to typhoid fever and heart disease (Marinus, 1996). 



Fig. 1. Salmonella typhimurium 

Unlike its close relative Escherichia coli, Salmonella enterica is never encountered as a 
commensal in humans but is always associated with disease. Most Salmonella serovars 
capable of infecting humans cause gastro-intestinal disease and are spread via contaminated 
food and water. The serovar Typhi is unique in that, it is a strict human pathogen which 
gives rise to a severe systemic disease called typhoid fever. Salmonella enterica serovar 
Typhimurium (which will be referred to as Salmonella typhimurium throughout this chapter) 
causes a systemic disease in certain inbred mouse strains that share many similarities to 
typhoid fever in humans. The Salmonella infection is initiated upon ingestion of the bacteria. 
A small fraction of the bacteria survive the acidic environment in the stomach and establish 
an infection in the small intestine, where the Salmonella multiply and displace the normal 
flora bacteria. The bacteria then cross the intestinal epithelium by invading the M cells of the 
Peyer's patches and enter the blood circulation. Systemic disease in both humans and mice 
is associated with the capacity of the bacteria to survive and replicate in macrophages and in 
the late stages of the disease. Salmonella can be found in large numbers in the liver and 
spleen. The murine typhoid fever model has been used extensively to study the interactions 
between pathogenic bacteria and their host. 
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Salmonella typhimurium represents a concern with regard to food safety due to its ability to 
growth in a wide range of adverse environmental conditions. Moreover, Salmonella 
typhimurium which contaminates foods may be derived from environments in which its 
previous growth occurs at moderate acidic conditions and, consequently, it may develop 
adaptive responses which enhance its resistance to other stress conditions occurring during 
food processing (Foster & Hall, 1990; Leyer & Johnson, 1993; Tosun & Gonul, 2003). 

One of the most important and intensively studied stress responses in Salmonella 
typhimurium is the acid tolerance response which increases its subsequent ability to survive 
in high acid foods, as well as in the extreme acid conditions of the gastrointestinal tract, 
increasing the risk of illness (Greenacre et al., 2003; Leyer & Johnson, 1992; Waterman & 
Small, 1998; Yuk & Schneider, 2006). 

1.3 DNA replication in Salmonella typhimurium 

In Salmonella typhimurium, the initiation of replication of chromosomal DNA is coordinated 
with cell division. It was demonstrated that DNA replication in bacterial cells is initiated on 
membranes and the activities of replication proteins are regulated by membrane 
components (Jacob et al, 1963). Indeed, initiation of DNA replication is precisely regulated 
in the cell cycles (Messer & Weigel, 1996). The SeqA protein seems to be one of the key 
proteins in the control of this process (Lu et al., 1994; Wold et al., 1998). It is an inhibitor of 
the onset of Escherichia coli chromosome replication in vivo (Boye et al., 1996; Slater et al., 
1995) and, at high concentrations, of the replication initiator protein, DnaA in vitro, but it 
may stimulate replication at low DnaA concentrations in vitro (Wold et al., 1998). It affects 
DNA topology and inhibits open complex formation at the replication origin (Kang et al., 
2003; Torheim & Skarstad, 1999). It was demonstrated that SeqA limits in vivo DnaA activity 
in replication from the chromosomal origin oriC (Von Freiesleben et al., 1994). Moreover, 
SeqA protein is essential for sequestration, which affects oriC in the newly replicated 
hemimethylated state (Lu et al., 1994; Slater et al., 1995). Disruption of the sec\A gene of 
Salmonella typhimurium causes filament formation, aberrant nucleoid segregation, induction 
of the SOS response, envelope instability, and increased sensitivity to membrane-damaging 
agents like bile salt. These defects are similar to those described in Escherichia coli. Recent 
results (Prieto et al., 2007) indicated that lack of SeqA renders Salmonella enterica sensitive to 
sodium choleate (ox bile extract) but the cause of this sensitivity remains unknown. 

In the next sections, we show the importance of DNA sequestration by describing in detail 
the implication of SeqA in the replication synchrony. Then, we demonstrate the changes in 
the lipid profile of the membrane cell after seqA mutation and we expose explanations of 
increasing bile sensitivity of Salmonella typhimurium, in sec\A mutant. In final section, we 
summarize our paper and present the future directions of our research. 

2. DNA replication and membrane sequestration 

The initiation of chromosomal replication occurs only once during the cell cycle in both 
prokaryotes and eukaryotes. This initiation is the first and tightly controlled step of a DNA 
synthesis. 

Because much of what is known about the regulation of the initiation of bacterial 
chromosomal replication comes from studies of Escherichia coli, this review focuses mainly 
on regulatory mechanisms in Sahnonella typhimurium. 
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2.1 What is DNA sequestration by SeqA protein? 

Replication of the bacterial chromosomal DNA initiates only once, at a specific region 
known as the origin of chromosomal replication oriC, by the initiator protein DnaA. This 
protein interacts specifically with 9-bp non-palindromic sequences (DnaA boxes) that exists 
at oriC. To ensure that initiation at an origin occurs only once per cell cycle, specific 
mechanisms exist to control chromosomal replication. In one mechanism, the SeqA protein 
that is tightly bound to hemimethylated DNA by a mechanism known as sequestration and 
which recognizes GATC sequences overrepresented within oriC and prefers binding to 
hemimethylated over binding to fully or unmethylated oriC (Figure 2). 



Fig. 2. DNA sequestration by SeqA 

The chromosomal DNA is methylated at adenine residues in GATC sequences by Dam 
methylase. Following passage of the DNA replication fork, GATC sites methylated on the 
top and bottom strands in a mother cell (denoted as fully methylated) are converted into 
two hemimethylated DNA duplexes: one methylated on the top strand and nonmethylated 
on the bottom strand and one methylated on the bottom strand and nonmethylated on the 
top strand due to semi-conservative replication. Most GATC sites are rapidly remethylated 
by the enzyme DNA methyltransferase (Dam methylase or Dam) and exist in the 
hemimethylated state for only a fraction of the cell cycle (Figure 3). 

Exceptions are the DNA replication origin of Salmonella typhimurium, the dnaA promoter, 
and possibly additional GATC sites in the chromosome which bind SeqA. SeqA 
preferentially binds to clusters of two or more hemimethylated GATC sites spaced one to 
two helical turns apart (Figure 4). 

In the case of oriC, sequestration delays remethylation and prevents binding of the DnaA 
protein, which controls the initiation of DNA replication. At other sites, binding of SeqA 
tetramers to hemimethylated GATC sites may organize nucleoid domains. Notably, the 
transcription profile of a Salmonella typhimurium SeqA- mutant was found to be similar to 
that of a Dam overproducer strain. Based on this observation, a model was developed in 
which Dam and SeqA compete for binding to hemimethylated DNA generated at the 
replication fork. 








The Absence of the “GATC - 

Binding Protein SeqA” Affects DNA Replication in Salmonella enterica Serovar Typhimurium 


287 


CH, 

_Li_ 

GATC 
C T A G 

-1- 

CH 3 


ch 3 

_Li_ 

GATC 
C T A G 


t 

^ Dam 


Rapid 

- > 


ch 3 

_Li_ 

GATC 
C T A G 
I 

CH 3 


I 

GATC 
C T A G 

-1- 

CH 3 


ch 3 

_LT_ 

GATC 
C T A G 
I 

ch 3 


Fig. 3. The vast majority of chromosomal GATC sites are fully methylated until DNA 
replication generates two hemimethylated species, one methylated on the top strand and 
one methylated on the bottom strand. Within a short time after replication (less than 5 min). 
Dam methylates the nonmethylated GATC site, regenerating a fully methylated GATC site. 
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Fig. 4. Two or more helically phased GATC sites can be bound by SeqA when they are in the 
hemimethylated state. Binding of SeqA inhibits Dam methylation, maintaining the 
hemimethylated state for a portion of the cell cycle. Dissociation of SeqA allows Dam to 
methylate the hemimethylated DNAs, generating fully methylated DNA. 


2.2 Effects of seqA mutation on DNA replication 

As we said before, following the replication fork progression and the nascent strand 
synthesis, the daughter DNA becomes hemimethylated. SeqA protein binds to the 
hemimethylated GATC sequences (hemi-sites) and performs various roles to control the cell 
cycle progression. Immediately after the initiation of replication SeqA binds to the replicated 
oriC and sequesters it from remethylation and reinitiation of replication at the replicated 
oriC. SeqA tracks replication forks as a multiprotein complex and contributes to the 
maintenance of superhelicity and decatenation of daughter chromosomes through the 
stimulation of topoisomerase IV and results in a synchronous replication. 

When rounds of replication are allowed to run to completion, the number of chromosomes 
per cell is 2n (n = 0,1, 2, 3, etc). When initiations are asynchronous, as in dnciA (Ts) initiation 
mutants at the permissive temperature and in the Escherichia coli dam mutant (Boye & 
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Lobner-Olesen, 1990; Skarstad et al., 1988), the presence of a different number of 
chromosome equivalents (three, five, six, etc.) was detected by flow cytometry. The presence 
of cells containing a number of chromosomes different from 2n suggests that the seqA 
mutant has a defect in the synchrony of replication initiation. Wild type and seqA mutant of 
Salmonella typhimurium growing exponentially in glucose-casamino acid medium were 
treated with rifampicin and cephalexin, which block initiation of replication and cell 
division respectively. Wild-type cells initiated replication synchronously (number of 
chromosomes per cell is 2n). The appearance of cells with chromosome numbers other than 
2n indicates a moderate asynchrony of initiation. So, flow cytometer analysis of our seqA 
mutants has shown that replication initiation is asynchronous and can occur throughout the 
cell cycle, not only at the normal cell age for initiation. The most likely reason for this 
asynchrony phenotype is that secondary initiations occurred at newly replicated origins in 
seqA mutants, due to lack of sequestration and inadequate methylation. We showed that 
initiation synchrony was dependent on intact GATC methylation sites. This loss of 
synchrony affected culture growth rates and cell size distributions only slightly and suggest 
that seqA mutants have a slight defect in synchronizing replication initiation. All these 
results suggest that DNA sequestration plays a role in preventing the occurrence of multiple 
initiations at a single origin in the same replication cycle. However, using flow cytometry, 
we found that the asynchrony of initiation, which is one of the phenotypes of the seqA 
mutation, was returned to almost normal in a seqA null mutant harboring the wild-type 
seqA gene under the control of a tac promoter. 

The OFF- to ON-phase rate was reduced in a seqA mutant, but much of this effect could be 
accounted for by a reduction in the Dam/DNA ratio caused by increased asynchronous 
initiation of DNA replication that occurs in the absence of SeqA, which normally sequesters 
oriC and plays a critical role in timing of DNA replication (Bogan & Helmstetter, 1997). 

3. Membrane instability after seqA disruption 

The origin of replication, oriC, is highly enriched in GATC sequences, which are sites for 
methylation by Dam methylase. Semi-conservative replication of fully methylated DNA 
generates hemimethylated oriC sites. 

3.1 Membrane sequestration hemimethylated of oriC 

Early studies demonstrated that membranes are capable of binding to hemimethylated oriC 
in vitro and in vivo , but not to fully methylated or unmethylated oriC (Ogden et al., 1988). 
While they are sequestered at the membrane, the recently replicated origins are unavailable 
for reinitiation and are protected from methylation by Dam methylase for an extended 
period. The origins remain sequestered until conditions in the cell are no longer in a state 
supportive for initiation (Figure 5). 

Prior to initiation of DNA replication. Dam methylase sites are fully methylated. 
Immediately following replication, the newly synthesized strand is unmethylated, and the 
resulting hemimethylated origin is sequestered at the at lipid bilayer of membrane by SeqA. 
This is not accessible to replicatively active ATP-DnaA. After approximately one-third of 
the cell cycle, the sequestered origin is released and methylated by Dam methylase. At this 
point in the cell cycle, the levels of ATP-DnaA are not sufficient to catalyze a new round of 
replication. As such, sequestration serves as a mechanism to prevent secondary initiations. 
Subsequent work identified SeqA protein to be an essential factor for oriC sequestration. 
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Fig. 5. Membrane sequestration of recently replicated origins. 

Even though the first steps of SeqA purificiation involve liberating SeqA from the 
membrane fraction of cell lysates by treatment with high concentrations of salt and 
sonication, the primary sequence for SeqA protein does not suggest any obvious membrane¬ 
associating domains. This is supported by the crystal structure of the C-terminal DNA- 
binding domain, and by biochemical studies that show that the N-terminal domain serves in 
the aggregation of SeqA protein into functional homotetramers (Guarne et al., 2002). Yet, 
there is some evidence that SeqA has an association with membranes (d'Alencon et al., 1999; 
Wegrzyn et al., 1999). The original data that newly replicated, hemimethylated origins are 
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sequestered at the membrane hold true. Whether membrane sequestration of oriC occurs 
directly through the SeqA protein or through an as yet unidentified factor remains unclear. 
We speculated that the examination of fatty acids composition and phospholipids fractions 
in wild type and seqA mutants would provide useful information to understand the 
interaction between SeqA protein and bacterial membrane in Salmonella typhimurium. 

3.2 Effects of seqA mutation on membrane lipids 

The coordination of the synchronization of the replication initiation, the activation of the 
DnaA protein at oriC, and the cellular cycle suggested the existence of a very narrow 
interaction between the bacterial membrane lipids and the SeqA protein (Landoulsi et al., 
1990). Acidic phospholipids, such as cardiolipin and phosphatidylglycerol, decrease the 
affinity of adenine nucleotide for DnaA protein (Mizushima et al., 1997; Sekimizu & 
Kornberg, 1988). Thus, it has been proposed that phospholipids regulate the activity of 
DnaA protein in cells and in vitro (Makise et al., 2002; Sekimizu & Kornberg, 1988). It has 
been demonstrated that the sec\A mutation can overcome the incompatibility phenotype 
observed between the chromosomal oriC and minichromosomal oriC copies in the dam 
mutant strain (Lobner-Olesen & Von Freiesleben, 1996). The mutation in the seqA gene 
allows efficient transformation of fully methylated minichromosomes into dam mutant cells 
(Lu et al., 1994; Von Freiesleben et al., 1994). We can suggest a possible interaction between 
the activities of SeqA protein and membrane lipids. 

We analyzed the phospholipids and the fatty acids composition of the bacterial membrane 
with the aim of correlating the membrane structure variation in this lipids with seqA gene 
mutation. 

The Phospholipids extracted from the bacterial membrane were separated and identified by 
thin layer chromatography. The content of each phospholipid was calculated from the fatty 
acids contents measured by the capillary gas chromatography method and is reported in the 
following section. The phospholipids found were phosphatidylethanolamine, 
phosphatidylglycerol, and cardiolipin on the basis of the following criteria: 

• Identity of chromatographic behavior in thin layer chromatography with synthetic and 
purified commercial phospholipids from various sources; 

• The flow rate is the same as that of commercial phospholipids; and 

• The phospholipids are the same as those reported by several authors and works (Ames, 
1968). 

• Phospholipids composition of Salmonella typhimurium wild type membrane 

The major phospholipids present in Sahnonella typhimurium wild type strain membrane 
were phosphatidylethanolamine, accounting for about 75.2%, followed by 
phosphatidylglycerol and cardiolipin (19.4% and 5.3%, respectively) (Figure 6.a). These 
phospholipids distributions agreed very closely to those reported in the literature for 
Sahnonella typhimurium (Ames, 1968). 

• Phospholipids composition of the Sahnonella typhimurium seqA mutant membrane 
Phosphatidylethanolamine, phosphatidylglycerol, and cardiolipin proportions were 
affected by the seqA mutation while comparing them with the wild type strain. In the 
seqA mutant, the zwitterionic phosphatidylethanolamine fraction decreased from 75.2% 
to 20.53%. However, the acidic phospholipid fractions (phosphatidylglycerol and 
cardiolipin) becomes a majority of total phospholipids with 79.47%, distributed in 
70.6% of phosphatidylglycerol and 8.8% of cardiolipin (Figure 6,b). 
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Fig. 6. Comparative analysis of percentage of phospholipids levels in wild type (a) and seqA 
(b) Salmonella typhimurium strains. Their contents were calculated from the fatty acid 
contents measured by the capillary gas chromatography method. Average values of 
triplicates were given, and the deviation was less than 5% of each value (significance was 
assessed using the Student's f-test). 

The membrane fatty acid composition of the Salmonella typhimurium wild type strain 
was determined by the capillary gas chromatography method. Many fatty acids were 
found and seven main peaks were identified by comparing their retention times with 
those of known standards. Three saturated fatty acids were tetradecanoic (myristic) acid 
(Ci4:o), hexadecanoic (palmitic) acid (Ci6 : o), and octadecanoic (stearic) acid (Cmo), two 
monounsaturated fatty acids were hexadecenoic (palmitoleic) acid (C 1 & 1 W 7 ) and 
octadecenoic (oleic or vaccenic) acid (Ci8:iw9), and two cyclic fatty acids were the cis- 
9,10-methylenehexadecanoic acid (cycm) and the cis-9,10-methyleneoctadecanoic 
(lactobacillic) acid (cycw). Their relative percentages were between 2% and 46% 
corresponding to more than 96% of all fatty acids observed. Some other minor fatty 
acids were also detected at lower relative concentrations: Cmo, Ci 8 : 2 w 6 , Ci8:3w6, Ci8:3w3, 
Cl9:0, and C20:0- 

• Fatty acid composition of Salmonella typhimurium wild type membrane 

In the wild type strain, Ci6:o, Ci 6 :i W 7 , and Ci8 : o were the main constituents, 
representing about 60% of total fatty acids. The proportion of total lipid cyclic fatty 
acids obtained was about 25.79%. However, minimum cyclic fatty acids levels were 
observed for phosphatidylethanolamine and Cardiolipin (6.43% and 5.08%, 
respectively) and higher one for phosphatidylglycerol (38.35%). The unsaturated to 
saturated fatty acids ratio was in the majority with respect to between the 
phospholipid fractions (table 1). 
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Fatty acids 

Total lipids Phosphatidylethanolamine Phosphatidylglycerol 

Cardiolipin 

Cl4:0 

4.20 ± 0.09 

4.75 ± 0.1 

4.2 ± 0.005 

11.27 ±0.13 

Cl6:0 

46.61 ± 0.22 

53.16 ± 0.26 

35.68 ± 0.61 

65.2 ±1.04 

Cl6:l w7 

7.16 ± 0.47 

8.54 ± 0.31 

5.43 ± 0.77 

7.04 ± 0.06 

cyciy 

7.53 ± 0.6 

6.43 ± 0.24 

10.70 ± 0.21 

2 ± 0.51 

Cl8:0 

8.51 ± 0.91 

1.13 ± 0.04 

8.80 ± 0.08 

5.90 ± 0.09 

Cl8:l w9 

2.07 ± 0.01 

2.45 ± 0.21 

3.28 ± 0.31 

2.24 ± 0.04 

cyci 9 

18.26 ± 0.38 

19.17 ±0.63 

27.65 ± 0.59 

3.08 ± 0.82 

MUFA 

5.66 

4.37 

4.26 

3.27 

XSFA 

59.32 

59.04 

48.68 

82.37 

XUFA 

9.23 

10.99 

8.71 

9.28 

XCFA 

25.79 

25.6 

38.35 

5.08 

UFA/SFA 

0.155 

0.186 

0.179 

0.113 


Table 1. Membrane fatty acid composition (molar percent) in total lipids and different 
phospholipid classes in the Salmonella typhimurium wild type strain (MUFA: 
Monounsaturated fatty acids; SFA: Saturated fatty acids; UFA: Unsaturated fatty acids; 

CFA: Cyclic fatty acids; UFA/SFA: Unsaturated to saturated ratio). 

• Fatty acid composition of the Sahnonella typhimurium seqA mutant membrane 

To determine whether the mutation in the seqA gene affected membrane lipid 
components, fatty acid composition was quantified. Our results indicated that the fatty 
acid composition of the total lipids appeared to be unaffected by the seqA mutation 
(table 2). The loss of cardiolipin and phosphatidylethanolamine was accompanied with 
a decrease in the proportion of Ci 4 :o, Ci6 : o, and Ci 6 :i W 7 and an increase in the proportion 
of Ci8:o especially for the cardiolipin phospholipid (from 5.9% to 41.93%). Compared 
with the isogenic wild type strain, cardiolipin and phosphatidylethanolamine 
phospholipids showed an increase in the percentages of cycw and a decrease in their 
Ci 8 :iw 9 , which resulted in low level of acyl chain unsaturation of fatty acids (table 2). The 
phosphatidylglycerol fraction showed a great increase of both Cmo and cyci7 and a 
decrease in Ci 6 :i W 7 , which resulted in a low unsaturated to saturated fatty acids ratio 
(table 2). 

Various physiological and biochemical changes took place as a consequence of many gene 
mutations, which can lead to numerous damages in the structure and function of the 
membrane cells (Shibuya et al., 1985; Taylor & Cronan, 1976). The purpose of the work 
presented in this section was to investigate a possible connection between both the seqA 
gene (coding for the sequestration protein SeqA) and some membrane components in 
Sahnonella typhimurium. The phospholipids and fatty acids were the object of attention 
because the membrane delimiting the cell, and presumably playing a key role in DNA 
replication, is supposed to be constituted largely of lipids. Interactions of SeqA protein with 
cellular membranes have been previously reported. However, although regulation of the 
activities of this protein by membranes or their components was reported (Oshima et al., 
2002) or suggested (Slater et al, 1995; Wegrzyn et al, 1999), little is known about the 
influence of SeqA on the composition of Sahnonella typhimurium cell membranes. So, we 
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suggest that in addition to its direct role in the sequestration of oriC region of the 
chromosome on the membrane, SeqA could activate or impair the expression of some genes 
(e.g., STM1329: putative inner membrane protein and yijP\ putative integral membrane 
protein, respectively) that interact with lipid metabolism and regulate acidic phospholipids 
synthesis. 


Fatty acids 

Total lipids Phosphatidylethanolamine Phosphatidylglycerol 

Cardiolipin 

Cl4:0 

3.74 ± 0.18 

4.19 ± 0.25 

2.77 ± 0.21 

4.85 ± 0.87 

Cl6:0 

42.81 ± 0.14 

45.8 ± 0.32 

54.25 ± 0.13 

36.38 ± 0.38 

Cl6:l w7 

4.24 ± 0.37 

2.53 ± 0.57 

1.38 ± 0.02 

4.13 ± 0.24 

cyci7 

11.6 ±0.11 

20.63 ± 0.13 

13.70 ± 0.08 

0.35 ± 0.05 

Cl8:0 

12.96 ± 1.06 

1.15 ± 0.17 

4.08 ± 0.89 

41.93 ± 0.01 

Cl8:l w9 

1.5 ± 0.12 

0.40 ± 0.18 

0.45 ± 0.07 

0.000 

cyc 19 

20.7 ± 0.27 

24.11 ± 1.06 

22.42 ± 0.98 

9.77 ± 0.49 

MUFA 

2.45 

1.19 

0.95 

2.59 

XSFA 

59.51 

51.14 

61.10 

83.16 

XUFA 

5.74 

2.93 

1.83 

4.13 

ICFA 

32.30 

44.74 

36.12 

10.12 

UFA/SFA 

0.096 

0.057 

0.030 

0.050 


Table 2. Membrane fatty acid composition (molar percent) in total lipids and different 
phospholipid classes in the Salmonella typhimurium seqA mutant strain. 


4. Increased sensitivity of membrane to bile salt after seqA disruption 

The overall purpose of this last section was to study the modifications of the cell membrane 
compounds of Salmonella typhimurium during the growth in the presence of ox bile doses. 

The results obtained evidenced that the tested substances induced noticeable modifications 
of the phospholipids and fatty acids composition of cell membrane during bacterial growth. 

4.1 Phospholipids composition of bile treated seqA mutants 

Exposed to the ox bile, and compared with the non treated cells, our results indicated that 
the phospholipids composition of the bile treated wild type strain was in the majority with 
respect to the non treated wild type strain. So it appeared to be unaffected by the ox bile 
stress. The acidic phospholipid fractions (phosphatidylglycerol and Cardiolipin) account for 
25.4% of total Phis distributed in 20.7% of phosphatidylglycerol and 4.7% of Cardiolipin. A 
non significant decrease in the phosphatidylethanolamine fraction (74.6%) was observed. 

To evaluate the combined effects of the seqA mutation and the ox bile stress on the bacterial 
membrane integrity, we compared the phospholipids composition of the exposed seqA 
mutant with the non exposed seqA and wild type strains. Compared with these tow 
Salmonella typhimurium strains, the acidic phospholipids (phosphatidylglycerol and 
Cardiolipin) showed a great increase with 81.7% and 14.1%, respectively. However, the 
phosphatidylethanolamine proportion decreased dramatically to 4.2% (Figure 7). 
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Fig. 7. Comparative analysis of phospholipid levels in Salmonella typhimurium wild type and 
secjA strains (control and exposed to ox bile). Exponentially growing wild type strain was 
incubated at 37°C with ox bile extract at a concentration of one percent. Their contents were 
also calculated from the fatty acids contents. Average values of triplicates were given, and 
the deviation was less than 5% of each value (significance was assessed using the Student's 
f-test). 

4.2 Effect of the ox bile combined to the seqA mutation on membrane fatty acids 
composition of Salmonella typhimurium 

The membrane fatty acids composition (molar percent) of the wild type and seqA mutant 
strains exposed to ox bile is shown in table 3. 

• Membrane fatty acids cofnposition of the wild type strains exposed to the ox bile. 

For the wild type strain cultured with the ox bile, no significant changes were observed 
in both total lipids and phospholipids (cardiolipin, phosphatidylglycerol, and 
phosphatidylethanolamine). The fatty acids composition appeared to be unaffected by 
the ox bile stress with an unsaturated to saturated ratio, in the majority, with respect to 
that of wild type control strain. 

• Membrane fatty acids composition of the seqA mutant strains exposed to the ox bile. 

To determine whether the mutation in the seqA gene added to the ox bile stress affected 
membrane lipid components, fatty acids composition was quantified. The membrane fatty 
acids composition of the total lipids was highly affected by the ox bile stress (table 3). 

The fatty acids were characterized by low level of cyclic fatty acids, representing about 
22.06% of total content, and high level of unsaturated fatty acids, representing about 15.74% 
(25.79% / 9.23% and 32.30% / 5.74%, respectively, for the wild type and the seqA mutant 
strains). These changes were due to a decrease in the cyclopropane derivatives C17- and C19- 
cyclic fatty acids and a concomitant increase in the unsaturated fatty acids (Ci6:iw7 and 
Ci 8 :iw 9 ) and resulted in a high unsaturated to saturated ratio (table 3). The accumulation of 
the cardiolipin fraction was accompanied with an increase in the Ci 8 :iw 9 composition, which 
rise up from 0.0% to 3.0% (table 3). In the phosphatidylglycerol fraction, data confirmed that 
also Ci 8 :iw 9 becomes a prominent species accounting for about 4.25% (3.28% and 0.45% 
respectively for the wild type and seqA phosphatidylglycerol fractions). Finally, we noticed 
that the phosphatidylethanolamine phospholipids were characterized by a reduction in 
cyclic fatty acids (cyci7 and cycw) to the profile of their unsaturated fatty acids derivatives 
(Ci 6 :iw 7 and Ci 8 : i W 9 )- These phospholipids changes resulted in a high unsaturated to 
saturated ratio (table 3). 

It has been proposed that intracellular pathogens like Salmonella are exposed to several 
stressing agents such as bile during the infection process. Stress conditions which 
pathogenic pathogen encounter during infection course can affect membrane components. 
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Fatty acids 

Total lipids 

Phosphatidylethanolamine Phosphatidylglycerol 

Cardiolipin 

Cl4:0 

4.78 ± 0.04 

5.15 ± 0.90 

2.80 ± 0.21 

5.42 ± 0.06 

Cl6:0 

44.31 ± 0.56 

46.38 ± 0.22 

56.01 ± 0.03 

37.38 ± 0.03 

Cl6:l w7 

9.89 ± 0.09 

4.36 ± 0.07 

3.18 ± 0.42 

6.13 ± 0.24 

CyCi7 

5.96 ± 0.44 

17.03 ± 0.06 

11.16 ± 0.18 

0.09 ± 0.83 

Cl8:0 

13.11 ± 0.03 

2.84 ± 0.83 

4.38 ± 0.61 

42 ± 0.01 

Cl8:l w9 

5.85 ± 0.52 

4.20 ± 0.03 

4.25 ± 0.07 

3 ± 0.54 

cyci 9 

16.1 ± 0.07 

20.04 ± 0.36 

18.22 ± 0.46 

5.98 ± 0.07 

MUFA 

3.06 

2.74 

1.75 

3.43 

IS FA 

62.2 

54.37 

63.19 

84.8 

IUFA 

15.74 

8.56 

7.43 

9.13 

ICFA 

22.06 

37.07 

29.38 

6.07 

UFA/SFA 

0.253 

0.157 

0.117 

0.107 


Table 3. Membrane fatty acid composition (molar percent) in total lipids and different 
phospholipid classes in the Salmonella typhimurium sec\A mutant strain exposed to ox bile. 


The environmental control of regulatory mechanisms is mediated by complex processes. 
Salmonella comes in contact with bile salts in the intestine and it is able to resist the action of 
bile and respond to escalating bile concentrations by increasing mechanisms of resistance. 
Previous studies showed that bacteria with enhanced tolerance to acid, bile and blood 
serum survive (Morgan et al., 1986, Wilmes-Riesenberg et ah, 1996) and cause disease 
(Foster & Hall, 1990; Rowbury et al., 1989) better than sensitive bacteria and showed that in 
vitro acid adapted Salmonella were more resistant towards bile (Velkinburg & Gunn, 1999) 
and acids (Foster & Hall, 1990) in comparison to non-adapted cells. 

Results obtained in this study show that the ox bile stress added to the seqA mutation is an 
important factor affecting Salmonella typhimurium resistance and could contribute to find 
new strategies based on intelligent combinations of hurdles, which could prevent the 
development or survival of Salmonella spp. in gastrointestinal tract. Salmonella typhimurium 
cells have developed efficient protection systems to cope with a variety of physicochemical 
unfavorable conditions and to adapt to the environmental stresses. In particular, 
fundamental for the microbial cells is to maintain membrane integrity and functionality in 
response to environmental stresses encountered during infection. In response to stresses, the 
phospholipids can alter their acyl chain structure by changing the ratio of saturation to 
unsaturation, cis to trails unsaturation, branched to unbranched structure and type of 
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branching and acyl chain length (Russel, 1984). Different modulation mechanisms can be 
used in relation to the physiological state of the cells (Rock & Cronan, 1996). 

In a previous work Prieto et al., (2007) have shown that the seqA mutation renders Salmonella 
enterica sensitive to agents known to be antimicrobially active in the host like sodium 
choleate (ox bile extract) but the cause of this sensitivity remains unknown. In the present 
study we tried to explain the reasons and to investigate a possible connection between both 
the seqA gene (coding for the sequestration protein SeqA) and some membrane components 
and the sensitivity to the ox bile observed in Salmonella typhimurium. 

In Escherichia coli, seqA mutants show altered membrane permeability (Wegrzyn et al., 1999) 
and abnormal phospholipids composition, which may explain their increased sensitivity to 
a number of dyes. The observation that the envelope of Salmonella enterica seqA mutants is 
likewise unstable (Aloui et al., 2010) can be tentatively correlated with bile sensitivity, 
because unconjugated bile salts can enter the cell by diffusion (Thanassi et al., 1997). Thus, a 
structural role of SeqA in envelope stability cannot be discounted (Wegrzyn et al., 1999). An 
alternative explanation is that SeqA might regulate the expression of genes involved in the 
stability and the integrity of the cell membrane against bile salts during infection process, a 
possibility also considered in Escherichia coli (Strzelczyk et al., 2003). 

In summary. Salmonella typhimurium SeqA protein is required for maintenance of membrane 
integrity against the ox bile. Mutation in the seqA gene causes envelope defects and 
enhances sensitivity to the ox bile which together may contribute to the attenuation of 
virulence and may induce strong immune responses in infected animals. Recently, it has 
been demonstrated that Salmonella typhimurium lacking seqA gene exhibit a decrease of 
virulence in mice perhaps due to the bile sensitivity during the infection process. So we 
suggest that this mutant may be applied to the design of a live vaccine. 

5. Conclusion 

In the Salmonella typhimurium cell, DNA sequestration modulates a variety of processes such 
as DNA replication and transcription of certain genes. Deletion of the seqA gene produces a 
variety of phenotypes ranging from replication asynchrony to virulence attenuation, 
indicating multiple functions for the GATC-binding protein in modulating gene expression, 
proper chromosome segregation, initiation of chromosome replication, and nucleoid 
stabilization. Given these multiple roles, it is not surprising that seqA mutation is highly 
pleiotropic. However, the lack of SeqA protein does not impair viability. Sahnonella 
typhimurium seqA- strain described here lacks binding of SeqA to GATC sequences and is 
more sensitive to this mutation than the wild type which shows the inverse. In addition, no 
great difference between the seqA mutant of Salmonella typhimurium and those of some 
enterobacterial species such as Escherichia coli was observed with replication asynchrony or 
alteration membrane. In conclusion, the role of SeqA in the prokaryotic cellular processes 
such as the DNA replication and lipids membrane metabolism is clear. So it may rely on its 
capacity as a global regulator of the gene expression during bacterial life, in vitro , in a 
similar manner as it does in vivo. 

6. Future research 

Our knowledge on the effects of SeqA protein in Sahnonella typhimurium has considerably 
improved in the last decade. This fundamental research has several implications that will 
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prove to be useful for the development of novel therapeutic approaches. But, to date, 
therapeutic applications are still in their early experimental phases, but several recent 
studies provide promising results for future clinical developments. Over the last few years, 
many studies have demonstrated that Salmonella typhimurium seqA mutants exhibit 
asynchronous DNA replication and are highly attenuated for virulence in mice and have 
been proposed as live vaccines. These results prove that GATC-binding sites might have a 
role in regulating virulence of Salmonella typhimurium and perhaps in other related bacteria. 
In addition, future research must focus on the study of the decreasing virulence and the 
proteomic and enzymatic activities of a seqA mutant strain. So this perspectives can be 
useful to more fully understand the significance of the results obtained above. Of special 
interests are: firstly, the growing list of genes governed by DNA sequestration in bacterial 
pathogens ; secondly, the finding of novel genes regulated by SeqA protein using high 
throughput analysis, and, thirdly, the evidence that this protein may regulate the expression 
of many unidentified genes involved in DNA replication and membrane metabolism. 

Finally, the way in which SeqA participates clearly in the DNA replication and in the 
membrane integrity is a critical question that deserves further investigation in the near 
future, and may be research studies will have to identify explanations. 
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